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CROSS-REFERENCES TO RELATED APPLICATIONS 
[0001] This application is a U.S. National Phase Application, filed under 35 U.S.C. §371 of 
5 Patent Cooperation Treaty Application Number PCT/US2005/00302, filed January 6, 2005, 
and claims priority to U.S. Provisional Application Number 60/535,263, filed January 9, 
2004. Each of the aforementioned applications are hereby incorporated by reference in their 
entirety and for all purposes. 

STATEMENT AS TO RIGHTS TO INVENTIONS MADE UNDER 
10 FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT 

[0002] NOT APPLICABLE 

REFERENCE TO A "SEQUENCE LISTING," A TABLE, OR A COMPUTER 
PROGRAM LISTING APPENDIX SUBMITTED ON A COMPACT DISK. 
1 5 [0003] NOT APPLICABLE 

BACKGROUND OF THE INVENTION 
[0004] Humans have exploited the use of genetics and microorganisms for their own 
advantage throughout much of recorded history. Egyptians are credited with the first use of 
yeast to produce leavened bread sometime between 4000-2000 BC, Gregor Mendel produced 

20 peas having specific, defined, characteristics in the mid- 19th century, and the Food and Drug 
Administration approved the first recombinant drug, human insulin, in 1982. This last feat is 
often considered to be the beginning of the modern biotechnology industry. Since then, 
transgenic plants, recombinant foods, recombinant vaccines, cancer therapeutics, recombinant 
antibodies, enzymes, glycosyltransferases, cytokines, coagulation factors, hormones, dermal 

25 replacements, anti-virals, and many other recombinant proteins have been developed for 
human use. 

[0005] The nucleic acid expression vector has greatly aided in the production of 
recombinant proteins and therapeutics. A nucleic acid encoding a protein reagent or 
therapeutic protein can be cloned into an expression vector, which can be expressed in a 
30 population of eukaryotic and/or prokaryotic cells, thus producing a large amount of a 
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recombinant protein or therapeutic. However, the yield and quality of the recombinant 
product depend greatly on the expression vector and microorganism used to express the 
vector. In addition, the use of recombinant cells can be slow and tax resources that can be 
otherwise used for discovery and improvement of recombinant proteins and therapeutics. 

5 [0006] As the demand and usefulness of recombinant proteins increases, new methods are 
required in order to more efficiently prepare such proteins with a rapid turnaround time. 
Moreover, as recombinant proteins for the treatment of a variety of diseases are generated, 
methods to lower the cost of their production need to be implemented so that these 
technologies are available to all those in need. The need to provide improved vectors for 
10 protein expression does not exist solely in the therapeutic protein arena. Rather, this need also 
extends to the production of proteins or reagents(e.g., enzymes) for use in the production of 
both protein and non-protein therapeutics. 

[0007] Over the past several decades, recombinant proteins and therapeutics have proven to 
be the answer in treating many diseases that were not addressed using conventional, chemical 
1 5 therapeutics. However, recombinant technology has been hampered by inefficiency, 
especially in small scale situations, as well as high cost and slow turnaround time. 
Streamlining the expression of proteins at a lower cost with a quicker turnaround time for 
virtually any customer situation is needed to realize the potential of recombinant proteins as 
reagents and as therapeutics. The present invention meets this need. 

20 BRIEF SUMMARY OF THE INVENTION 

[0008] The invention includes a method of providing a therapeutic protein to a customer, 
comprising cloning a nucleic acid encoding a protein into a pCWinl expression vector as set 
forth in SEQ ID NO:l, expressing a protein therefrom, and providing the protein to a 
customer. 

25 [0009] In another aspect of the invention, a method of providing a therapeutic protein to a 
customer comprises cloning a nucleic acid encoding a protein into a pCWin2 expression 
vector as set forth in SEQ ID NO:2, expressing a protein therefrom, and providing the protein 
to a customer. 

[0010] In yet another aspect of the invention, a method of providing a therapeutic protein to 
30 a customer comprises cloning a nucleic acid encoding a protein into a pCWin2/MBP 

expression vector as set forth in SEQ ID NO:3, expressing a protein therefrom, and providing 
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the protein to a customer. In still another aspect, a method of providing a therapeutic protein 
to a customer comprises cloning a nucleic acid encoding a protein into a pCWin2-MBP-SBD 
(pMS 39 ) expression vector as set forth in SEQ ID NO: 10, expressing a protein therefrom, and 
providing the protein to a customer. In yet another aspect of the invention, a method of 
5 providing a therapeutic protein to a customer comprises cloning a nucleic acid encoding a 
protein into a pCWin2-MBP-MCS-SBD (pMXS 3 9) expression vector as set forth in SEQ ID 
NO: 1 1 , expressing a protein therefrom, and providing the protein to a customer. 

[0011] In an embodiment of the invention, a pCWIN2/MBP vector comprises a protease , 
cleavage site coding sequence between the MBP coding sequence and the therapeutic protein 
1 0 coding sequence. 

[0012] Therapeutic proteins useful in the present invention include erythropoietin, human 
growth hormone, granulocyte colony stimulating factor, interferons alpha, -beta, and - 
gamma, Factor IX, follicle stimulating hormone, interleukin-2, erythropoietin, anti-TNF- 
alpha, and lysosomal hydrolases such as beta-glucosidase, alpha-galactosidase-A, beta- 
15 hexosaminidase, beta- gal actosidase, alpha-galactosidase, alpha-mannosidase, beta- 
mannosidase, alpha- L-fucosidase, beta-glucuronidase, alpha-glucosidase, alpha-N- 
acetylgalactosaminidase, and acid phosphatase. 

[0013] In one embodiment of the invention, a method of providing a protein to a customer 
includes cloning a nucleic acid encoding a protein into a pCWinl expression vector as set 
20 forth in SEQ ID NO: 1 , expressing a protein therefrom, and providing the protein to a 
customer. 

[0014] In another embodiment of the invention, a method of providing a protein to a 
customer includes cloning a nucleic acid encoding a protein into a pCWin2 expression vector 
as set forth in SEQ ID NO:2, expressing a protein therefrom, and providing the protein to a 
25 customer. 

[0015] In another embodiment of the invention, a method of providing a protein to a 
customer includes cloning a nucleic acid encoding a protein into a pCWin2/MBP expression 
vector as set forth in SEQ ID NO:3, expressing a protein therefrom, and providing the protein 
to a customer. In still another aspect, a method of providing a protein to a customer comprises 
30 cloning a nucleic acid encoding a protein into a pCWin2-MBP-SBD (pMS 39 ) expression 
vector as set forth in SEQ ID NO: 10, expressing a protein therefrom, and providing the 
protein to a customer. In yet another aspect of the invention, a method of providing a protein 
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to a customer comprises cloning a nucleic acid encoding a protein into a pCWin2-MBP- 
MCS-SBD (PMXS39) expression vector as set forth in SEQ ID NO:l 1, expressing a protein 
therefrom, and providing the protein to a customer. 

[0016] In one aspect of the invention, a protein may be a glycosyltransferase or a sugar 
5 nucleotide-generating enzyme. 

[0017] In an aspect of the invention, an expression vector includes an affinity tag coding 
sequence. In this aspect of the invention, an affinity tag may be a histidine tag, a Factor IX 
tag, a glutathione-S-transferase tag, starch-binding domain and a FLAG-tag. 

[0018] The invention includes a method of providing a therapeutic protein to a customer, 
1 0 where the method includes providing an expression vector to a protein production facility 

wherein a nucleic acid encoding a protein is cloned into the expression vector and the protein 
is expressed therefrom in the protein production facility; subsequently providing the protein 
to a customer. In an aspect of the invention, the expression vector comprises a multiple- 
cloning region and an antibiotic resistance marker. The antibiotic resistance marker may be 
15 kanamycin, tetracycline, or chloramphenicol. In another aspect of the invention, the 
expression vector includes an affinity tag. 

[0019] In an embodiment of the invention, a method of providing a protein to a customer 
includes providing a pCWinl vector as set forth in SEQ ID NO:l to a protein production 
facility, wherein a nucleic acid encoding a protein is cloned into the expression vector and the 
20 protein is expressed therefrom in the protein production facility, and the protein is provided 
to a customer. 

[0020] In an embodiment of the invention, a method of providing a protein to a customer 
includes providing a pCWin2 vector as set forth in SEQ ID NO:2 to a protein production 
facility, wherein a nucleic acid encoding a protein is cloned into the expression vector and the 
25 protein is expressed therefrom in the protein production facility, and the protein is provided 
to a customer. 

[0021] In an embodiment of the invention, a method of providing a protein to a customer 
includes providing a pCWin2/MBP vector as set forth in SEQ ID NO:3 to a protein 
production facility, wherein a nucleic acid encoding a protein is cloned into the expression 
30 vector and the protein is expressed therefrom in the protein production facility, and the 
protein is provided to a customer. In another embodiment of the invention, a method of 
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providing a protein to a customer includes providing a pCWin2-MBP-SBD (pMS 39 ) vector as 
set forth in SEQ ID NO: 10 to a protein production facility, wherein a nucleic acid encoding a 
protein is cloned into the expression vector and the protein is expressed therefrom in the 
protein production facility, and the protein is provided to a customer. In still another 
5 embodiment of the invention, a method of providing a protein to a customer includes 

providing a pCWin2-MBP-MCS-SBD (pMXS 39 ) vector as set forth in SEQ ID NO:l 1 to a 
protein production facility, wherein a nucleic acid encoding a protein is cloned into the 
expression vector and the protein is expressed therefrom in the protein production facility, 
and the protein is provided to a customer. 

10 [0022] In one aspect of the invention, a protein production facility is in-house. In another 
aspect of the invention, the protein production facility is offsite. 

[0023] The invention also includes a method of providing a protein to a customer, wherein 
at least one glycosyl moiety is added to a protein prior to providing the protein to a customer. 
In one aspect, a glycosyl moiety is added to a protein in vitro. 

1 5 [0024] The present invention includes a method of providing a protein to a customer, 

comprising cloning a nucleic acid encoding said protein into a pCWinl expression vector as 
set forth in SEQ ID NO:l, inserting the vector into a bacterial host cell, expressing the protein 
in the host cell, and providing the protein to a customer. 

[0025] Another embodiment of the invention includes a method of providing a protein to a 
20 customer, comprising cloning a nucleic acid encoding said protein into a pCWin2 expression 
vector as set forth in SEQ ID NO:2, inserting the vector into a bacterial host cell, expressing 
the protein in the host cell, and providing the protein to a customer. 

[0026] Another embodiment of the invention includes a method of providing a protein to a 
customer, comprising cloning a nucleic acid encoding said protein into a pCWin2/MBP 

25 expression vector as set forth in SEQ ID NO:3, inserting the vector into a bacterial host cell, 
expressing the protein in the host cell, and providing the protein to a customer. Yet another 
embodiment of the invention includes a method of providing a protein to a customer, 
comprising cloning a nucleic acid encoding said protein into a pCWin2-MBP-SBD (pMS 39 ) 
expression vector as set forth in SEQ ID NO: 10, inserting the vector into a bacterial host cell, 

30 expressing the protein in the host cell, and providing the protein to a customer. Still another 
embodiment of the invention includes a method of providing a protein to a customer, 
comprising cloning a nucleic acid encoding said protein into a pCWin2-MBP-MCS-SBD 
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(PMXS39) expression vector as set forth in SEQ ID NO:l 1, inserting the vector into a 
bacterial host cell, expressing the protein in the host cell, and providing the protein to a 
customer. 

[0027] In one embodiment of the invention, a method includes adding at least one glycosyl 
5 moiety to a protein prior to providing the protein to a customer. In one aspect, a glycosyl 
moiety is added to a protein in vitro. 

[0028] The invention features isolated pcWINl expression vector comprising the sequence 
set forth in SEQ ID NO:l. The invention also features an isolated pcWINl expression vector 
consisting of the sequence set forth in SEQ ID NO:l. 

10 [0029] In another aspect, the invention features an expression vector comprising the 
sequence set forth in SEQ ID NO:2. The invention also features an isolated pcWIN2 
expression vector consisting of the sequence set forth in SEQ ID NO:2. 

[0030] In yet another aspect, the invention features an isolated pcWIN2/MBP expression 
vector comprising the sequence set forth in SEQ ID NO:3. The invention also features an 
15 isolated pcWIN2/MBP expression vector consisting of the sequence set forth in SEQ ID 
NO:3. The invention further features a pcWIN2/MBP expression vector, wherein the 
pCWIN2/MBP vector comprises a protease cleavage site coding sequence adjacent to the 
MBP coding sequence. 

[0031] In another aspect, the invention features an isolated pCWin2-MBP-SBD (pMS 39 ) 
20 expression vector comprising the sequence set forth in SEQ ID NO: 10. The invention also 
features an isolated pCWin2-MBP-SBD (pMS 39 ) expression vector consisting of the 
sequence set forth in SEQ ID NO: 10. 

[0032] In still another aspect, the invention features an isolated pCWin2-MBP-MCS-SBD 
(pMXS 39 ) expression vector comprising the sequence set forth in SEQ ID NO:l 1 . The 
25 invention also features an isolated pCWin2-MBP-MCS-SBD (pMXS 39 ) expression vector 
consisting of the sequence set forth in SEQ ID NO: 1 1 . 

[0033] The invention features a method of expressing a protein from an isolated pcWINl 
expression vector comprising the sequence set forth in SEQ ID NO:l . In another 
embodiment, the invention features a method of expressing a protein from an isolated 
30 pcWIN2 expression vector comprising the sequence set forth in SEQ ID NO:2. In yet 

another embodiment, the invention features a method of expressing a protein from an isolated 
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pcWIN2/MBP expression vector comprising the sequence set forth in SEQ ID NO:3. In still 
another embodiment, the invention features a method of expressing a protein from an isolated 
pCWin2-MBP-SBD (pMS 3 9) expression vector comprising the sequence set forth in SEQ ID 
NO: 10. In another embodiment, the invention features a method of expressing a protein from 
5 an isolated pCWin2-MBP-MCS-SBD (PMXS39) expression vector comprising the sequence 
set forth in SEQ ID NO: 1 1 . In one aspect, the protein is expressed in a prokaryotic cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0034] The foregoing summary, as well as the following detailed description of the 
invention, will be better understood when read in conjunction with the appended drawings. 
1 0 For the purpose of illustrating the invention, there are shown in the drawings embodiment(s) 
which are presently preferred. It should be understood, however, that invention is not limited 
to the precise arrangements and instrumentalities shown. In the drawings: 

[0035] Figure 1 A is an image of an electrophoretic gel containing products of a restriction 
digest. Lanes 1 and 3 are BstEII DNA Marker, lane 2 is SacI/Xbal-digested Cst-04 vector 
15 and lane 4 is Kan r PCR product digested with SacI/XbaL 

[0036] Figure IB is an image of an agar plate showing the result of Cst-04-Kan r 
transformation plated on LB kan r plate. 

[0037] Figure 1C is an image of an electrophoretic gel containing DNA from an E. coli 
colony that screened positive for the Cst-04-Kan r insert. Lane 1 contains BstEII DNA 
20 Marker. Lanes 2-4 contain DNA isolated from the Cst-04-Kan5 colony. Lane 2 contains 
DNA cut with Ndel, lane 3 contains DNA cut with Sail, and lane 4 contains DNA cut with 
Pstl. 

[0038] Figure ID is an image of an ampicillin-containing agar plate and a kanamycin- 
containing agar plate, on both of which Cst-04-Kan5 was streaked. The ampicillin- 
25 containing plate inhibited the growth of Cst-04-Kan5 -containing cells, demonstrating that the 
ampicillin gene in Cst-04-Kan5 -containing cells is inactive, whereas the kanamycin- 
containing plate supports the growth of Cst-04-Kan5-containing cells, demonstrating that the 
kanamycin gene in Cst-04-Kan5-containing cells is operative. 

[0039] Figure 1 E is an image of thin-layer chromatography of the products of the activity 
30 of Cst-04Kan5 plasmid-containing cell lysates using lacto-N-neotetraose as a substrate. 
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Lanes labeled 1 and 2 are Cst-04Kan5 from JM109 cells, lanes labeled 3 are Cst-04Kan5 
isolated from TGI cells, and lanes labeled 4 are Cst-04-6-1. 

[0040] Figure 2A is an image of the agarose gel from which restriction enzyme-digested 
PCR products were isolated. Lanes marked "M" contain 1 kb DNA markers, lane 1 
5 contained pCWINl insert, lane 2 contained Ndel/Scal-digested pCWori Kan r Cst04Kan5 
vector, lane 3 contained pre-pCWIN2 insert, and lane 4 contained BamHI/EcoRI-digested 
pCWori Kan r Cst04Kan5 vector. 

[0041] Figure 2B is an image of an electrophoretic gel, illustrating the results of restriction 
digestion of plasmid DNA isolated from positive transformants as a result of pCWinl and 
10 pre-pCWin2 DNA mini-prep. Lanes labeled "M" contain 1 kb DNA markers. Lanes 1 to 5 
contain pCWinl clones. Lanes 6 to 14 contains pre-pCWin2 clones. All 14 clones were 
digested with EcoRl. 

[0042] Figure 2C is an image of an electrophoretic gel, illustrating the results of restriction 
digestion of plasmid DNA isolated from positive transformants as a result of pCWinl and 
1 5 pre-pCWin2 DNA mini -prep. Lanes labeled "M" contain 1 kb DNA markers. Lane 1 

contains pCWinl clone #5, lane 2 contains pre-pCWin2 clone #11. Both clones # 5 and #1 1 
were digested with Ndel and Seal. 

[0043] Figure 2D is an image of an electrophoretic gel, illustrating the results of restriction 
pCWin2 mini-prep screening. Lanes labeled "M" contains 1 kb DNA markers. Lanes 1 
20 through 18 contain pCWin2 clones. The clones were all digested with PstL 

[0044] Figure 3 A is an image of an electrophoretic gel containing the Ndel/BamHI- 
digested malE cDNA. 

[0045] Figure 3B is an image of an electrophoretic gel containing the restriction enzyme- 
digested pCWin2 vector. 

25 [0046] Figure 3C is an image of two electrophoretic gels containing the restrction enzyme- 
digested pCWin2 vector. This figure represents the screening of colonies to verify that the 
malE Ndel and BamHI insert size was correct. The first lane on each gel contains 1 kb DNA 
molecular weight markers, as indicated in the figure. Lanes 1, 2, 3, 4, 5, 7, 8, 9 and 10 
correspond to colonies selected from the transformation plate and which positively show the 

30 presence of the malE cDNA. Lane 6 corresponds to a colony selected from the 
transformation plate bearing a vector that does not contain the malE insert. 
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[0047] Figures 4A, 4B, and 4C comprise the entire nucleotide sequence of pcWINl , as set 
forth in SEQ ID NO:l. 

[0048] Figures 5 A, 5B 5 and 5C comprise the entire nucleotide sequence of pcWIN2, as set 
forth in SEQ ID NO:2. 

5 [0049] Figures 6A, 6B> 6C, and 6D comprise the entire nucleotide sequence of 
pcWIN2/MBP, as set forth in SEQ ID NO:3. 

[0050] Figure 7A is an image of an electrophoretic gel illustrating the results of a 
restriction enzyme-digested PCR reaction used to create the SBD 39 insert. Lane M is a Ikb 
DNA marker. Lane 1 is the SBD 39 PCR insert product digested with Bgl II and BamHl. The 
10 expected size for the SBD39 insert is 447bp. 

[0051] Figure 7B is an image of an electrophoretic gel illustrating the result of the 
restriction enzyme digestion of pCWin2-MBP-ST3Gal III (A73) vector. Lane M is a lkb 
DNA marker. Lane 1 is pCWin2-MBP-ST3Gal III (A73) digested with BamHl. The 
expected size for the vector is 7 kb. 

1 5 [0052] Figure 7C is an image of an electrophoretic gel illustrating the results of the DNA 
mini-prep enzymatic digestion screen of pCWin2-MBP-SBD-ST3Gal III (A73). Lanes M are 
a lkb DNA marker, lanes 1 through 1 1 are pCWin2MBP-SBD-ST3Gal III A73 construct 
colonies 1 through 1 1 respectively, digested with Ndel and BamHl. The expected size for 
the pCWin2-ST3Gal III A73 vector band is 5.9 kb and the expected size for the MBP-SBD 

20 insert is 1 .6 kb. Clone #6 (Lane 6) illustrates a positive result. 

[0053] Figure 7D is an image of an electrophoretic gel illustrating the restriction enzyme 
digestion of pCWin2 vector with BamHl and Seal. Lane M is a lkb DNA marker. Lane 1 is 
digested pCWin2. The expected size for the vector is 4.3 kb and the expected size for the 
BamH I /Sea I MCS insert is 0.8 kb. 

25 [0054] Figure 7E is an image of an electrophoretic gel illustrating the results of the 

restriction enzyme digestion of pCWin-MBP-SBD 39 -ST3Gal III (A73). Lane M is a lkb DNA 
marker. Lane 1 is pCWin-MBP-SBD 39 -ST3Gal III (A73) digested with BamH I and Sea I. 
The expected size for the vector is Linear is pCWin-MBP-SBD 39 is 5.8 kb and the expected 
size for the BamH I /Sea I ST3Gal III (A73) insert is 1.6 kb. 
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[0055] Figure 7F is an image of an electrophoretic gel illustrating the results of the DNA 
mini-prep restriction enzyme digestion screen of pCWin2-MBP-SBD 39 clones. Lanes M is a 
lkb DNA marker, lanes 1 through 16 are pCWin2-MBP-SBD 39 construct colonies 1 through 
16 respectively, digested with Nde I and Xba I. The expected size for the pCWin2 vector 
5 band is 5.0 kb and the expected size for the MBP-SBD insert is 1 .65 kb. Clone #1 (Lane 1 ) 
illustrates a positive result. 

[0056] Figure 7G is a nucleic acid vector feature map, illustrating restriction sites for the 
pCWin-MBP-SBD 39 (pMS 39 ) construct. 

[0057] Figure 8A is an electrophoretic gel illustrating the results of the PCR reaction to 
10 prepare the SBD insert. Lane M is X BstE II DNA marker, lane 1 is SBD PCR insert product. 
The expected size for the SBD insert is 447bp. 

[0058] Figure 8B is an image of an electrophoretic gel illustrating the results of DNA 
isloated from PCR-Blunt-SBD colonies and subjected to restriction enzyme digestion. Lane 
M is a X BstE II DNA marker, lanes 1 through 8 are PCR-Blunt-SBD colonies 13, 14, 15, 17, 
15 18, 19, 20, and 22 respectively, all digested with Xhol and Sail. The expected size for the 
PCR-Blunt vector band is 3kb and the expected size for the SBD insert is 447bp. 

[0059] Figure 8C is an image of an electrophoretic gel illustrating the results of the 
restriction enzyme-digested pCWin2-MBP kan r vector. Lane M is a lkb DNA ladder, lane 1 
is pCWin2-MBP kan r Vector digested with Xhol and Sal I. The expected size for the 
20 pCWin2-MBP kan r vector band is 3kb. 

[0060] Figure 8D is an image of an electrophoretic gel illustrating the results of DNA 
isolated from pCWin2-MBP-MCS-SBD (pMXS 39 ) vector-containing colonies. Lane M is a X 
BstE II DNA marker, lanes 1 through 13 are pMXS39 colonies 1 through 13 respectively, 
digested with Xhol and Sail. The expected size for the pCWin2-MBP vector band is 6. lkb 
25 and the expected size for the SBD insert is 447bp. Two out of thirteen colonies had the 
correct size of insert and pMXS 39 vector (see lanes 4 and 5 in Figure 8D). 

[0061] Figure 8E is a nucleic acid vector feature map, illustrating restriction sites for the 
pCWin-MBP-MCS-SBD 39 (pMXS 39 ) construct. 

DETAILED DESCRIPTION OF THE INVENTION 
30 [0062] The use of therapeutic proteins to treat patients experiencing disease or illness 

increases yearly. Protein therapeutics typically lack the same problematic side effects found 
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with certain traditional chemical therapeutics. Even in instances where the protein 
therapeutic is altered slightly from its natural state, such a protein typically does not have the 
same side effects as do certain chemical therapeutics. Similar to the increase in the use of 
therapeutic proteins, the use of non-therapeutic, or "reagent" proteins increases exponentially 
5 from year to year. For example, reagent proteins are used in such areas as food biochemistry, 
bioremediation, production of small molecule therapeutics, and even in the production of 
therapeutic proteins. 

[0063] The increasing use of protein reagents and therapeutics has enhanced the need for 
production and preparation of such proteins. It is generally impractical, in terms of cost and 

10 time, to isolate and purify a protein therapeutic from its natural source. The cost of isolating 
proteins from natural sources is prohibitive, and the amount of time needed for such isolation 
techniques is lengthy. For example, a difficult and time-consuming process for isolating a 
therapeutic protein from a natural source will drive up the cost of that reagent or therapeutic, 
which in the latter instance, may unduly burden a medical patient, the patient's insurer, or 

1 5 both. Further, a burdensome isolation process can limit the amount of therapeutic protein 
available to those in need thereof. Finally, a difficult isolation process can also overburden 
the entity that produces the reagent or therapeutic protein, reducing profits and wasting 
valuable business time. 

[0064] In vitro systems have therefore been developed to produce recombinant forms of 
20 reagent proteins and therapeutic proteins. One of the most significant groups of organisms 
used as an in vitro system for production of recombinant therapeutic proteins is bacteria, and 
in particular, Escherichia coli. E. coli is often used for its simplicity, as it is easy to culture 
and to maintain, and more importantly, it is easy to manipulate genetically. Further, it is 
relatively simple to isolate protein expressed from E. coli. 

25 [0065] There are numerous expression vectors that are compatible with bacteria, and in 
particular, with E. coli, for the purpose of producing recombinant therapeutic proteins. 
However, many vectors are useful only under particular circumstances, and therefore have 
drawbacks with respect to their utility for selected protein expression under the specific 
circumstances that may be required. The present invention sets forth methods of providing a 

30 protein to a customer that overcome some of the difficulties associated with commercial 
protein production. 
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[0066] The present invention therefore features a method of providing a protein to a 
customer, wherein the protein of interest is expressed in a vector containing a nucleic acid 
encoding the protein of interest, as well as a multiple cloning site and an antibiotic resistance 
marker, and further wherein the resulting protein is provided to a customer. Part of the 
5 advantage of the present invention is that the expression vectors of the present invention 
reduce the complexity of subcloning the cDNA encoding a therapeutic protein. Further, the 
expression vector of the present invention enables the production of proteins using an 
antibiotic resistance marker other than the ampicillin antibiotic resistance marker, which is 
not approved for Good Manufacturing Practice (GMP) protocols required by the Food and 
1 0 Drug Administration. 

[0067] Part of the advantage of the present invention is due to the flexibility of the 
expression vector used to express the protein. The flexibility of a vector of the present 
invention provides that a protein can be produced rapidly and efficiently. A cDNA encoding 
a protein of interest can be readily subcloned into the expression vector by way of a multiple 
1 5 cloning site. Therefore, a method of the present invention features the use of an expression 
vector as described above to provide a protein to a customer in a more efficient manner. 

[0068] Another advantage of the present invention is that expression vectors of the 
invention offer increased productivity and efficiency of protein expression. That is, the 
design and use of vectors of the present invention provide increased levels of protein 
20 expression and production, leading to increased efficiencies in protein expression over similar 
vectors known in the art. Such advantages increase the quantity of protein produced, and 
therefore also serve to lower the cost of protein production and increase profit through sales 
of protein. 

[0069] The flexibility and functionality of a vector of the present invention increases the 
25 ease, efficiency and reliability of the delivery of a protein to a customer. The use of a method 
of the present invention to streamline and enhance protein product delivery to a customer not 
only increases the production and profitability of a business entity using such methods, but it 
also has the effect of increasing the opportunity for medical patients in need thereof with a 
therapeutic protein. 

30 [0070] The present invention also features vectors for expression of reagent proteins, and 
methods of providing reagent proteins produced using such vectors to a customer. Vectors of 
the invention useful for the expression of reagent proteins may be the same vectors used to 
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produce therapeutic proteins in methods of the invention. Additionally, vectors of the 
invention useful for the expression of reagent proteins may be different vectors than those 
used to produce therapeutic proteins in methods of the invention. 

[0071] Vectors of the invention designed for production of reagent proteins are further 
5 useful for production of proteins that are themselves useful in the subsequent production of 
small chemical therapeutics and for remodeling of non-protein molecules, such as 
carbohydrates. Examples of such proteins include glycosyltransferases, glycosidases and 
enzymes used in the production of sugar nucleotides. Further, such proteins are useful to 
produce and remodel carbohydrate-containing glycoproteins. The production and remodeling 
10 of glycoproteins has significant therapeutic impact, as glycoproteins form the basis of a 
significant number of recombinant therapeutics. 

Definitions 

[0072] Unless defined otherwise, all technical and scientific terms used herein have the 
same meaning as commonly understood by one of ordinary skill in the art to which this 
1 5 invention belongs. Although any methods and materials similar or equivalent to those 

described herein can be used in the practice or testing of the present invention, the preferred 
methods and materials are described herein. 

[0073] As used herein, each of the following terms has the meaning associated with it in 
this section. 

20 [0074] The articles "a n and "an" are used herein to refer to one or to more than one (i.e. to 
at least one) of the grammatical object of the article. By way of example, "an element" 
means one element or more than one element. 

[0075] "Encoding" refers to the inherent property of specific sequences of nucleotides in a 
polynucleotide, such as a gene, a cDNA, or an mRNA, to serve as templates for synthesis of 

25 other polymers and macromolecules in biological processes having either a defined sequence 
of nucleotides (i.e., rRNA, tRNA and mRNA) or a defined sequence of amino acids and the 
biological properties resulting therefrom. Thus, a gene encodes a protein if transcription and 
translation of mRNA corresponding to that gene produces the protein in a cell or other 
biological system. Both the coding strand, the nucleotide sequence of which is identical to 

30 the mRNA sequence and is usually provided in sequence listings, and the non-coding strand, 
used as the template for transcription of a gene or cDNA, can be referred to as encoding the 
protein or other product of that gene or cDNA. 
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[0076] A "coding region" of a gene consists of the nucleotide residues of the coding strand 
of the gene and the nucleotides of the non-coding strand of the gene which are homologous 
with or complementary to, respectively, the coding region of an mRNA molecule which is 
produced by transcription of the gene. 

5 [0077] A "coding region" of an mRNA molecule also consists of the nucleotide residues of 
the mRNA molecule which are matched with an anticodon region of a transfer RNA 
molecule during translation of the mRNA molecule or which encode a stop codon. The 
coding region may thus include nucleotide residues corresponding to amino acid residues 
which are not present in the mature protein encoded by the mRNA molecule (e.g. , amino acid 
10 residues in a protein export signal sequence). 

[0078] An "affinity tag" is a peptide or polypeptide that may be genetically or chemically 
fused to a second polypeptide for the purposes of purification, isolation, targeting, trafficking, 
or identification of the second polypeptide. The "genetic" attachment of an affinity tag to a 
second protein may be effected by cloning a nucleic acid encoding the affinity tag adjacent to 
15 a nucleic acid encoding a second protein in a nucleic acid vector. 

[0079] As used herein, the term "glycosyltransferase," refers to any enzyme/protein that 
has the ability to transfer a donor sugar to an acceptor moiety. 

[0080] A "sugar nucleotide-generating enzyme" is an enzyme that has the ability to 
produce a sugar nucleotide. Sugar nucleotides are known in the art, and include, but are not 
20 limited to, such moieties as UDP-Gal, UDP-GlcNAc, and CMP -NAN. 

[0081] An "isolated nucleic acid" refers to a nucleic acid segment or fragment which has 
been separated from sequences which flank it in a naturally occurring state, e.g., a DNA 
fragment which has been removed from the sequences which are normally adjacent to the 
fragment, e.g., the sequences adjacent to the fragment in a genome in which it naturally 

25 occurs. The term also applies to nucleic acids which have been substantially purified from 
other components which naturally accompany the nucleic acid, e.g., RNA or DNA or 
proteins, which naturally accompany it in the cell. The term therefore includes, for example, 
a recombinant DNA which is incorporated into a vector, into an autonomously replicating 
plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a 

30 separate molecule (e.g, as a cDNA or a genomic or cDNA fragment produced by PCR or 

restriction enzyme digestion) independent of other sequences. It also includes a recombinant 
DNA which is part of a hybrid gene encoding additional polypeptide sequence. 
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[0082] In the context of the present invention, the following abbreviations for the 
commonly occurring nucleic acid bases are used. "A" refers to adenosine, "C" refers to 
cytidine, "G" refers to guanosine, "T" refers to thymidine, and "U" refers to uridine. 

[0083] A "polynucleotide" means a single strand or parallel and anti-parallel strands of a 
5 nucleic acid. Thus, a polynucleotide may be either a single-stranded or a double-stranded 
nucleic acid. 

[0084] The term "nucleic acid" typically refers to large polynucleotides. 

[0085] The term "oligonucleotide" typically refers to short polynucleotides, generally no 
greater than about 50 nucleotides. It will be understood that when a nucleotide sequence is 
10 represented by a DNA sequence (i.e., A, T, G, C), this also includes an RNA sequence (i.e., 
A, U, G, C) in which "U" replaces "T." 

[0086] Conventional notation is used herein to describe polynucleotide sequences: the left- 
hand end of a single-stranded polynucleotide sequence is the 5 5 end; the left-hand direction of 
a double-stranded polynucleotide sequence is referred to as the S'-direction. 

15 [0087] A first defined nucleic acid sequence is said to be "immediately adjacent to" a 
second defined nucleic acid sequence when, for example, the last nucleotide of the first 
nucleic acid sequence is chemically bonded to the first nucleotide of the second nucleic acid 
sequence through a phosphodiester bond. Conversely, a first defined nucleic acid sequence is 
also said to be "immediately adjacent to" a second defined nucleic acid sequence when, for 

20 example, the first nucleotide of the first nucleic acid sequence is chemically bonded to the 
last nucleotide of the second nucleic acid sequence through a phosphodiester bond. 

[0088] A first defined polypeptide sequence is said to be "immediately adjacent to" a 
second defined polypeptide sequence when, for example, the last amino acid of the first 
polypeptide sequence is chemically bonded to the first amino acid of the second polypeptide 
25 sequence through a peptide bond. Conversely, a first defined polypeptide sequence is said to 
be "immediately adjacent to" a second defined polypeptide sequence when, for example, the 
first amino acid of the first polypeptide sequence is chemically bonded to the last amino acid 
of the second polypeptide sequence through a peptide bond. 

[0089] The direction of 5 1 to 3' addition of nucleotides to nascent RNA transcripts is 
30 referred to as the transcription direction. The DNA strand having the same sequence as an 
mRNA is referred to as the "coding strand"; sequences on the DNA strand which are located 
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5' to a reference point on the DNA are referred to as "upstream sequences"; sequences on the 
DNA strand which are 3 1 to a reference point on the DNA are referred to as "downstream 
sequences." 

[0090] Unless otherwise specified, a "nucleotide sequence encoding an amino acid 
sequence" includes all nucleotide sequences that are degenerate versions of each other and 
that encode the same amino acid sequence. Nucleotide sequences that encode proteins and 
RNA may include introns. 

[0091] "Homologous" as used herein, refers to nucleotide sequence similarity between two 
regions of the same nucleic acid strand or between regions of two different nucleic acid 
strands. When a nucleotide residue position in both regions is occupied by the same 
nucleotide residue, then the regions are homologous at that position. A first region is 
homologous to a second region if at least one nucleotide residue position of each region is 
occupied by the same residue. Homology between two regions is expressed in terms of the 
proportion of nucleotide residue positions of the two regions that are occupied by the same 
nucleotide residue. By way of example, a region having the nucleotide sequence 5'- 
ATTGCC-3' and a region having the nucleotide sequence 5 f -TATGGC-3' share 50% 
homology. Preferably, the first region comprises a first portion and the second region 
comprises a second portion, whereby, at least about 50%, and preferably at least about 75%, 
at least about 90%, or at least about 95% of the nucleotide residue positions of each of the 
portions are occupied by the same nucleotide residue. More preferably, all nucleotide residue 
positions of each of the portions are occupied by the same nucleotide residue. 

[0092] As used herein, "homology" is used synonymously with "identity." The 
determination of percent identity between two nucleotide or amino acid sequences can be 
accomplished using a mathematical algorithm. For example, a mathematical algorithm useful 
for comparing two sequences is the algorithm of Karlin and Altschul (1990, Proc. Natl. Acad. 
Sci. USA 87:2264-2268), modified as in Karlin and Altschul (1993, Proc. Natl. Acad. Sci. 
USA 90:5873-5877). This algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul et al. (1990, J. Mol. Biol. 215:403-410), and can be accessed, for 
example, at the BLAST site of the National Center for Biotechnology Information (NCBI) 
world wide web site at the National Library of Medicine (NLM) at the National Institutes of 
Health (NIH). BLAST nucleotide searches can be performed with the NBLAST program 
(designated "blastn" at the NCBI web site), using the following parameters: gap penalty = 5; 
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gap extension penalty = 2; mismatch penalty = 3; match reward = 1 ; expectation value 10.0; 
and word size = 1 1 to obtain nucleotide sequences homologous to a nucleic acid described 
herein. BLAST protein searches can be performed with the XBLAST program (designated 
"blastn" at the NCBI web site) or the NCBI "blastp" program, using the following 
parameters: expectation value 10.0, BLOSUM62 scoring matrix to obtain amino acid 
sequences homologous to a protein molecule described herein. 

[0093] To obtain gapped alignments for comparison purposes, Gapped BLAST can be 
utilized as described in Altschul et al. (1997, Nucleic Acids Res. 25:3389-3402). 
Alternatively, PSI-Blast or PHI-Blast can be used to perform an iterated search which detects 
distant relationships between molecules (id.) and relationships between molecules which 
share a common pattern. When utilizing BLAST, Gapped BLAST, PSI-Blast, and PHI-Blast 
programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) 
can be used as available on the website of the National Center for Biotechnology Information 
of the National Library of Medicine at the National Institutes of Health. 

[0094] The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 
identity, typically exact matches are counted. 

[0095] "Polypeptide" refers to a polymer composed of amino acid residues, related 
naturally occurring structural variants, and synthetic non-naturally occurring analogs thereof 
linked via peptide bonds, related naturally occurring structural variants, and synthetic non- 
naturally occurring analogs thereof. Synthetic polypeptides can be synthesized, for example, 
using an automated polypeptide synthesizer. 

[0096] As used herein, amino acids are represented by the full name thereof, by the three 
letter code corresponding thereto, or by the one-letter code corresponding thereto, as 



indicated in the following table: 

Full Name Three- Letter Code One-Letter Code 

Aspartic Acid Asp D 

Glutamic Acid Glu E 

Lysine Lys K 

Arginine Arg R 

Histidine His H 

Tyrosine Tyr Y 
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1 5 [0097] The term "protein" typically refers to large polypeptides. 
[0098] The term "peptide" typically refers to short polypeptides. 

[0099] Conventional notation is used herein to portray polypeptide sequences: the left-hand 
end of a polypeptide sequence is the amino-terminus; the right-hand end of a polypeptide 
sequence is the carboxyl-terminus. 

20 [0100] A "therapeutic protein" as the term is used herein refers to any protein that is useful 
to treat a disease state or to improve the overall health of a living organism. A therapeutic 
protein may effect such changes in a living organism when administered alone, or when used 
to improve the therapeutic capacity of another substance. 

[0101] A "reagent protein" as the term is used herein refers to any protein that is useful in 
25 food biochemistry, bioremediation, production of small molecule therapeutics, and even in 
the production of therapeutic proteins. Typically, reagent proteins are enzymes capable of 
catalyzing a reaction to produce a product useful in any of the aforementioned areas. 

[0102] A "vector" is a composition of matter which comprises an isolated nucleic acid and 
which can be used to deliver the isolated nucleic acid to the interior of a cell. Numerous 
30 vectors are known in the art including, but not limited to, linear polynucleotides, 

polynucleotides associated with ionic or amphiphilic compounds, plasmids, and viruses. 
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Thus, the term "vector" includes an autonomously replicating plasmid or a virus. The term 
should also be construed to include non-plasmid and non-viral compounds which facilitate 
transfer of nucleic acid into cells, such as, for example, polylysine compounds, liposomes, 
and the like. Examples of viral vectors include, but are not limited to, adenoviral vectors, 
5 adeno-associated virus vectors, retroviral vectors, and the like. 

[0103] "Expression vector" refers to a vector comprising a recombinant polynucleotide 
comprising expression control sequences operatively linked to a nucleotide sequence to be 
expressed. An expression vector comprises sufficient cis-acting elements for expression; 
other elements for expression can be supplied by the host cell or in an in vitro expression 
10 system. Expression vectors include all those known in the art, such as cosmids, plasmids 
(e.g., naked or contained in liposomes) and viruses that incorporate the recombinant 
polynucleotide. 

[0104] A "multiple cloning site" as the term is used herein is a region of a nucleic acid 
vector that contains more than one sequence of nucleotides that is recognized by at least one 
1 5 restriction enzyme. 

[0105] An "antibiotic resistance marker" as the term is used herein refers to a sequence of 
nucleotides that encodes a protein which, when expressed in a living cell, confers to that cell 
the ability to live and grow in the presence of an antibiotic. 

[0106] The term "saccharide" refers in general to any carbohydrate, a chemical entity with 
20 the most basic structure of (CH 2 0) n . Saccharides vary in complexity, and may also include 
nucleic acid, amino acid, or virtually any other chemical moiety existing in biological 
systems. 

[0107] "Monosaccharide" refers to a single unit of carbohydrate of a defined identity. 

[0108] "Oligosaccharide" refers to a molecule consisting of several units of carbohydrates 
25 of defined identity. Typically, saccharide sequences between 2-20 units may be referred to as 
oligosaccharides. 

[0109] "Polysaccharide" refers to a molecule consisting of many units of carbohydrates of 
defined identity. However, any saccharide of two or more units may correctly be considered 
a polysaccharide. 



19 



[0110] A "party 55 as the term is used herein refers to an individual or an entity involved in a 
transaction related to a method, a vector, or a protein of the present invention. For example, 
an individual who provides a vector to a business entity is considered to be a "party 5 ' in the 
context of the present invention. Further, the business entity is also considered to be a 
5 "party 55 in the context of the present invention. 

[0111] A "recipient 55 as the term is used herein refers to a specific party who receives a 
vector or a protein of the present invention. For example, if an individual gives a business 
entity a vector of the invention, the business entity is considered to be a "recipient 55 in the 
context of the present invention. By way of another example, an individual within an 
1 0 organization may provide a second individual within the same organization with a vector or a 
protein of the present invention. The second individual, who is in receipt of a vector or a 
protein of the invention, is considered to be a "recipient 55 in terms of the present invention. It 
should be noted that a recipient may be a customer, but that not all customers are recipients. 

[0112] A "customer, 55 as the term is used herein, refers to an intended recipient of a specific 
15 item in a formal transaction. A customer is also a recipient, but is distinct from a recipient in 
that a customer is recipient who is an endpoint for a transaction, whereas a recipient may be 
an intermediate in a larger transaction. For example, customers of the present invention 
include, but are not limited to, an entity responsible for the creation of an expression vector 
that contains the cDNA encoding a protein of interest, if that entity will use the protein 
20 produced. A customer is also an entity responsible for expression of a protein from a vector 
that contains the cDNA encoding a protein of interest, if that entity will use the protein 
produced. A customer also may be an entity that purchases a protein expressed from a vector 
of the present invention, for the purpose of using the protein. Further, the entity that creates 
an expression vector of the present invention may be a customer if that entity uses the protein 
25 produced by the vector. 

[0113] A "protein production facility 55 as the term is used herein is any location that has the 
ability to express a protein encoded within a nucleic acid vector. 

[0114] As the term is used herein, "in-house 55 refers to dealings within a single 
organization. In this context, an organization may be a single company, two or more jointly 
30 cooperating laboratories, or a corporation and its subsidiaries, collectively. 
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[0115] "Offsite," as the term is used herein, refers to dealings that extend beyond the "in- 
house" context.discussed above. For example, the transfer of a vector of the invention from a 
first organization to a second organization is a transfer of that vector "offsite." 

L Vectors 

5 [0116] The present invention includes an isolated nucleic acid encoding a protein operably 
linked to a nucleic acid comprising a promoter/regulatory sequence such that the nucleic acid 
is capable of directing expression of the protein encoded by the nucleic acid. Thus, the 
invention encompasses expression vectors and methods for the expression of proteins based 
on exogenous DNA introduced into cells with concomitant expression of the exogenous DNA 
10 in the cells such as those described, for example, in Sambrook et al. (1989, supra), and 
Ausubel et al. (1997, supra). 

[0117] An expression vector of the present invention is based on the pcWori+ vector 
(Muchmore et al., 1987, Meth. Enzymol. 177:44-73). However, the pcWori+ vector by itself 
is not adequate for the production of protein reagents and therapeutic proteins according to 

1 5 the present invention. The pcWori+ vector contains an ampicillin resistance marker. Certain 
regulatory agencies require that the production of proteins for therapeutic use cannot be 
carried out using recombinant vectors containing ampicillin resistance genes. Therefore, an 
expression vector of the present invention features an antibiotic resistance marker approved 
by the U.S. Food and Drug Administration (FDA) for use in the production of protein 

20 reagents and therapeutic proteins. Such antibiotic resistance markers include, but are not 
limited to, kanamycin, tetracycline, and chloramphenicol. 

[0118] In the invention, the ampicillin resistance marker normally present in the pcWori+ 
vector is disabled as follows. Briefly, the ampicillin resistance marker in the pcWoriH- vector 
is disrupted in order to produce, in part, a vector of the present invention. PCR primers 
25 designed to create Pvul and Seal restriction enzyme cleavage sites on either end of a 

kanamycin resistance gene are used, and the resultant PCR product is digested with Pvul and 
Seal restriction enzymes. The ampicillin resistance gene in a pcWoriH- vector is also cut with 
Pvul and Seal restriction enzymes. Subsequently, the kanamycin resistance gene is ligated 
into the pcWori+ vector that was cleaved within the ampicillin resistance gene. 

30 [0119] Verification of successful disruption of the ampicillin resistance gene and successful 
insertion of the kanamycin resistance gene is observed by transforming E. coli cells, for 
example, with the ligation mixture. Growth of the transformed cells on kanamycin- 
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containing agar plates confirms the successful insertion of the kanamycin resistance marker, 
while lack of growth on ampicillin containing plates confirms the successful disruption of the 
ampicillin resistance gene. Other methods of disruption or deletion of the ampicillin 
resistance gene will be known to one of skill in the art. Similarly, other methods of inserting 
5 the kanamycin resistance gene, or any other antibiotic resistance gene useful in the present 
invention, and methods of confirming the insertion and/or deletion of genes will also be 
known to one of skill in the art. 

[0120] Another feature of a vector of the present invention is a versatile and highly- 
functional multiple cloning site. As described in, for example, in Sambrook et al. (1989, 

10 supra), a "multiple cloning site" is a nucleic acid having a sequence encoding more than one 
restriction enzyme recognition site. The practical purpose of a multiple cloning site is to 
allow the ligation (i.e., "insertion") of an exogenous polynucleotide into the multiple cloning 
site, wherein the exogenous polynucleotide may have different restriction enzyme recognition 
sequences at its 5' and 3' ends. That is, the multiple cloning site allows flexibility with 

15 respect to the identity of the 5' and 3' ends on an exogenous polynucleotide, thus facilitating 
the cloning of such a polynucleotide into the multiple cloning site. 

[0121] A multiple cloning site is most often found, and is most useful, in a nucleic acid 
vector such as a vector of the present invention. As will be known to the skilled artisan, a 
multiple cloning site may be located adjacent to other functional elements in a vector, such as 
20 a promoter. A multiple cloning site may also be designed such that insertion of an exogenous 
polynucleotide into the multiple cloning site results in the exogenous polynucleotide being 
expressed in frame with the adjacent elements to create a fusion protein of the protein 
encoded by the exogenous polynucleotide and the protein encoded by the adjacent element. 

[0122] Accordingly, a vector of the present invention contains at least one multiple cloning 
25 site. The creation of a functional multiple cloning site in a vector of the present invention is 
described in greater detail elsewhere herein. Briefly, a multiple cloning site may be designed 
and synthesized de novo, or it may be isolated from another pre-existing vector. PCR 
methods are used to create multiple cloning site polynucleotides having specific restriction 
enzyme recognition sites on either end (5 5 and 3') of the multiple cloning site polynucleotide. 
30 A multiple cloning site polynucleotide is then inserted into a pcWori-i- vector of the present 
invention by means of specific restriction enzyme recognition sites corresponding to those on 
either end of the multiple cloning site polynucleotide. As will be understood by one skilled 

22 



in the art, various molecular biological techniques are available to insert, delete, and/or 
modify a multiple cloning site in a vector of the present invention in order to create a more 
functional and flexible multiple cloning site useful in connection with the present invention. 

[0123] Another feature of a vector of the present invention is the option of an affinity tag 
5 coding sequence located in the multiple cloning site. An affinity tag coding sequence may be 
inserted into the multiple cloning site adjacent to, upstream from, or downstream from a 
target protein coding sequence. As will be understood by one of skill in the art, an affinity 
tag will typically be inserted into the multiple cloning site in frame with the target protein. 
One of skill in the art will also understand that an affinity tag coding sequence can be used to 
10 produce a recombinant fusion protein by concomitantly expressing the affinity tag and target 
protein. The expressed fusion protein can then be isolated, purified, or identified by means of 
the affinity tag. An affinity tag is especially important when expressing proteins that are 
reagents and less important when expressing therapeutic proteins due to restrictions imposed 
by regulatory agencies. 

1 5 [0124] Affinity tags useful in the present invention include, but are not limited to, a maltose 
binding protein, a histidine tag, a Factor IX tag, a glutathione- S -transferase tag, a FLAG-tag, 
and a starch binding domain tag. Other tags are well known in the art, and the use of such 
tags in the present invention would be readily understood by the skilled artisan. 

[0125] Any single vector of the present invention may have more than one feature 
20 described herein. By way of a non-limiting example, a vector of the present invention may 
have a disrupted ampicillin resistance gene, a functional kanamycin resistance gene, and a 
modified, multi-functional multiple cloning site. An example of one such vector of the 
present invention is pCWinl, the sequence of which is set forth in SEQ ID NO:l. A pCWinl 
vector of the present invention has, for example, two BamHI restriction enzyme recognition 
25 sites, one of which is located within the multiple cloning site. Another example of a vector of 
the present invention is pCWin2, the sequence of which is set forth in SEQ ID NO:2. A 
pCWin2 vector of the present invention has, for example, only one BamHI restriction enzyme 
recognition site which is located within the multiple cloning site. 

[0126] A further example of a vector of the present invention is pCWin2-MBP, the 
30 sequence of which is set forth in SEQ ID NO:3. A pCWin2-MBP vector of the invention has, 
for example, one BamHI restriction enzyme recognition site located within the multiple 
cloning site and additionally has an E. coli malE maltose binding protein coding sequence 
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inserted into the multiple cloning site in between the Ndel and BamHI restriction enzyme 
recognition sites. The Ndel sequence in the multiple cloning site contains an ATG start 
codon. The pCWin2-MBP vector is therefore useful, for example, for expression of a fusion 
protein comprised of a maltose binding protein and a desired protein. This is achieved by 
5 inserting a polynucleotide encoding the desired protein into the multiple cloning site in frame 
with the maltose binding protein and expressing the entire open reading frame encoded in the 
multiple cloning site. 

[0127] Yet another example of a vector of the present invention is pCWin2-MBP-SBD 39 
(pMS 39 ), the sequence of which is set forth in SEQ ID NO: 10. A pCWin2-MBP-SBD 39 

1 0 (pMS 3 9) vector of the invention has, for example, one BamHI restriction enzyme recognition 
site located within the multiple cloning site, and one EcoRI restriction enzyme recognition 
site located within the multiple cloning site, and additionally has an E. coli malE maltose 
binding protein coding sequence inserted into the multiple cloning site in between the Ndel 
and Sad restriction enzyme recognition sites. The pCWin2-MBP-SBD 39 (pMS 39 ) vector also 

15 has a starch-binding domain (SBD) inserted between the EcoRI and BamHI restriction sites. 
The Ndel sequence in the multiple cloning site contains an ATG start codon. The pCWin2- 
MBP-SBD 39 (pMS 39 ) vector is therefore useful, for example, for expression of a fusion 
protein comprised of a maltose binding protein, a starch binding domain, and a desired 
protein. This is achieved by inserting a polynucleotide encoding the desired protein into the 

20 multiple cloning site in frame with the maltose binding protein and starch binding domain, 
and expressing the entire open reading frame encoded in the multiple cloning site. 

[0128] Still another example of a vector of the present invention is pCWin2-MBP-MCS- 
SBD 39 (pMXS 39 ), the sequence of which is set forth in SEQ ID NO: 1 1 . As compared to, the 
pMXS 39 vector expresses, in one aspect of the invention, a fusion protein with the structure 

25 "MBP — desired protein — SBD," whereas the pMS 39 vector expresses, in another aspect of 
the invention, a "MBP — SBD — desired protein" fusion protein. Accordingly, a pCWin2- 
MBP-SBD 39 (pMXS 39 ) vector of the invention has, for example, one Xhol restriction enzyme 
recognition site located within the multiple cloning site, and one Sail restriction enzyme 
recognition site located within the multiple cloning site, and additionally has an E. coli malE 

30 maltose binding protein coding sequence inserted into the multiple cloning site in between 

the Ndel and Sad restriction enzyme recognition sites. The pCWin2-MBP-SBD 39 (pMXS 39 ) 
vector also has a starch-binding domain (SBD) inserted between the Xhol and Sail restriction 
sites. The Ndel sequence in the multiple cloning site contains an ATG start codon. The 
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pCWin2-MBP-SBD 3 9 (PMXS39) vector is therefore useful, for example, for expression of a 
fusion protein comprised of a maltose binding protein, a starch binding domain, and a desired 
protein. This is achieved by inserting a polynucleotide encoding the desired protein into the 
multiple cloning site in frame with the maltose binding protein and starch binding domain, 
5 and expressing the entire open reading frame encoded in the multiple cloning site. 

[0129] A vector of the present invention, as described above, is useful for the production of 
a therapeutic protein. A polynucleotide sequence encoding a therapeutic protein may be 
inserted into the multiple cloning site using any technique known to the skilled artisan. For 
example, a polynucleotide sequence encoding a therapeutic protein may be modified to 
10 contain specific restriction enzyme recognition sites at the 5' and 3' ends of the 

polynucleotide. Such restriction enzyme recognition sites will correspond to recognition sites 
located within the multiple cloning site of a vector of the present invention, facilitating the 
insertion (by ligation) of the therapeutic protein-encoding sequence into the multiple cloning 
site, and when expressed, producing a therapeutic protein. 

1 5 [0130] Therapeutic proteins useful in the present invention are numerous and are well- 
known in the art, and are therefore not listed here. By way of a non-limiting example, such 
therapeutic proteins include erythropoietin, human growth hormone, granulocyte colony 
stimulating factor, interferons alpha, -beta, and -gamma, Factor IX, follicle stimulating 
hormone, interleukin-2, erythropoietin, anti-TNF-alpha, and a lysosomal hydrolase. 

20 Lysosomal hydrolases useful in the present invention include, but are not limited to, beta- 
glucosidase, alpha-galactosidase-A, beta-hexosaminidase, beta-galactosidase, alpha- 
galactosidase, alpha-mannosidase, beta-mannosidase, alpha-L-fucosidase, beta- 
glucuronidase, alpha- glucosidase, alpha-N-acetylgalactosaminidase, and acid phosphatase. It 
will be understood that any mutant or variant of a therapeutic protein may be expressed using 

25 vectors of the present invention. 

[0131] The present invention also features a vector useful for the production of a non- 
therapeutic protein, referred to herein as reagent proteins. As will be understood by the 
skilled artisan, a reagent protein is one which does not currently have a therapeutic 
application. Such proteins include, but are not limited to, enzyme reagents, food enzymes, 
30 nutritional supplements, and non-active additives. Methods of expressing reagent proteins 
using vectors of the invention will be understood by the skilled artisan to be conducted in the 
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same manner as the above-described methods of expressing therapeutic proteins using 
vectors of the present invention. 

[0132] Another feature of a vector of the present invention is the option of a protease 
cleavage site coding sequence located in the multiple cloning site. A protease cleavage site 
5 coding sequence may be inserted into the multiple cloning site adjacent to, upstream from, or 
downstream from a target protein coding sequence. As will be understood by one of skill in 
the art, a protease cleavage site will typically be inserted into the multiple cloning site in 
frame with the target protein. One of skill in the art will also understand that a protease 
cleavage site coding sequence can be used to produce a recombinant fusion protein by 
10 concomitantly expressing the protease cleavage site sequence and target protein. The 

expressed fusion protein can then be isolated, purified, or identified by means of the protease 
cleavage site. 

[0133] In an embodiment of the invention, a vector contains a coding sequence for a 
protease cleavage site which is located C-terminal to the nucleic acid sequence encoding an 
15 MBP. In one aspect, a vector is pCWin2-MBP-SBD 39 (pMS 39 ). In another aspect, a vector is 
pCWin2-MBP-MCS-SBD 39 (pMXS 39 ). 

[0134] A fusion protein containing a preselected protease cleavage site, as will be 
understood by one of skill in the art, is useful for the removal of amino acid sequence that is 
extraneous or non-essential to the expressed protein of interest. For example, a target protein 

20 may be expressed using a vector of the present invention as a fusion with an affinity tag for 
the purpose of purification of the target protein, but the affinity tag may not be desirable once 
the protein is sufficiently purified. The insertion of a specific protease cleavage site between 
the target protein and the affinity tag is useful for the cleavage of the affinity tag from the 
target protein. Protease cleavage sites useful in the present invention include, but are not 

25 limited to, an enterokinase cleavage site, a Factor Xa cleavage site, a thrombin cleavage site, 
and a TEV protease cleavage site. The skilled artisan will understand the characteristics and 
uses of a protease cleavage useful in the present invention. 

[0135] The present invention also features a recombinant bacterial host cell comprising , 
inter alia, a nucleic acid vector as described elsewhere herein. In one aspect, the recombinant 
30 cell is transformed with a vector of the present invention. The transformed vector need not be 
integrated into the cell genome nor does it need to be expressed in the cell. However, the 
transformed vector will be capable of being expressed in the cell. In one aspect of the 
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invention, E. coli is used for transformation of a vector of the present invention and 
expression of protein therefrom. In another aspect of the invention, a K-12 strain of E. coli is 
useful for expression of protein from a vector of the present invention. Strains of E. coli 
useful in the present invention include, but are not limited to, JM83, JM101, JM103, JM109, 
5 W3 1 1 0, chi 1 776, and JA22 1 . 

[0136] It will be understood that a host cell useful in the present invention will be capable 
of growth and culture on a small scale, medium scale, or a large scale. For example, a host 
cell of the invention is useful for testing the expression of a protein from a vector of the 
invention equally as much as it is useful for large scale production of a reagent or therapeutic 
1 0 protein product. Techniques useful in culturing host cells and expressing protein from a 
vector contained therein are well known in the art and will therefore not be listed herein. 

[0137] A host cell of the present invention may be transformed with a vector of the present 
invention to produce a transformed host cell of the invention. Transformation, as known to 
the skilled artisan, includes the process of inserting a nucleic acid vector into a host cell, such 
1 5 that the host cell containing the nucleic acid vector remains viable. Such transformation of 
nucleic acid into a bacterial cell is useful for purposes including, but not limited to, creation 
of a stably-transformed host cell, making a biological deposit, propagating the vector- 
containing host cell, propagating the vector-containing host cell for the production and 
isolation of additional vector, expression of target protein encoded by vector, and the like. 

20 [0138] Methods of transforming a vector are numerous and well-known in the art, and will 
therefore not be listed here. By way of a non-limiting example, a competent bacterial cell of 
the invention may be transformed by a vector of the invention using electroporation. 
Methods of making bacterial cells "competent" are well-known in the art, and typically 
involve preparation of the bacterial cells so that the cells take up exogenous DNA. Similarly, 

25 methods of electroporation are known in the art, and detailed descriptions of such methods 
may be found, for example, in Sambrook et al. (1989, supra). The transformation of a 
competent cell with vector DNA may be also accomplished using chemical-based methods. 
One example of a well-known chemical -based method of bacterial transformation is 
described by Inoue, et al. (1990, Gene 96:23-28). Other methods of transformation will be 

30 known to the skilled artisan. 

[0139] In one embodiment of the present invention, a Cst-04Kan5 plasmid is transfomed 
into E. coli JM109 cells using 20 jliI JM109 competent cells in 0.34 \i\ 1.42 M beta 
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mercaptoethanol, incubated on ice for 10 minutes, at which time ljil (100 ng) Cst-04-Kan5 
plasmid is added to the transformation mixture. The cell/DNA mixture is incubated ice for 
30 minutes, then heat shocked at 42 °C for 45 seconds. The reaction is then incubated on ice 
for 2 minutes, at which time 80 jal SOC media is added. The reaction mixture is then shaken 
5 at 37°C for 1 hour, and subsequently, plated on LB Kan r agar plates. Identification and 
confirmation of the Cst-04-Kan5 plasmid DNA is carried out using a restriction enzyme 
digestion of plasmid DNA isolated from positive transformants, using Ndel, Sail, PstI 
restriction enzymes. 

[0140] A transformed host cell of the present invention may be used to express a protein. 
10 In an embodiment of the invention, a transformed host cell contains a vector of the invention, 
which contains therein a nucleic acid sequence encoding an exogenous protein. The protein 
is expressed using any expression method known in the art (for example, IPTG). The 
expressed protein may be contained within the host cell, or it may be secreted from the host 
cell into the growth medium. 

1 5 [0141] Methods for isolating an expressed protein are well-known in the art, and the skilled 
artisan will know how to determine the best method for isolation of an expressed protein 
based on the characteristics of any given host cell expression system. By way of a non- 
limiting example, an expressed protein that is secreted from a host cell may be isolated from 
the growth medium. Isolation of a protein from a growth medium may include removal of 

20 bacterial cells and cellular debris. By way of another non-limiting example, an expressed 
protein that is contained within a host cell may be isolated from the host cell. Isolation of 
such an "intracellular" expressed protein may include disruption of the host cell and removal 
of cellular debris from the resultant mixture. These methods are not intended to be exclusive 
representations of the present invention, but rather, are merely for the purposes of illustration 

25 of various applications of the present invention. 

[0142] Purification of a protein expressed in accordance with the present invention may be 
effected by any means known in the art. The skilled artisan will know how to determine the 
best method for the purification of a protein expressed in accordance with the present 
invention. A purification method will be chosen by the skilled artisan based on factors such 
30 as, but not limited to, the expression host, the contents of the crude extract of the protein, the 
size of the protein, the properties of the protein, the desired end product of the protein 
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purification process, and the subsequent use of the end product of the protein purification 
process. 

[0143] In an embodiment of the invention, isolation or purification of a protein expressed 
in accordance with the present invention may not be desired. In an aspect of the present 
5 invention, an expressed protein may be stored or transported inside the bacterial host cell in 
which the protein was expressed. In another aspect of the invention, an expressed protein 
may be used in a crude lysate form, which is produced by lysis of a host cell in which the 
protein was expressed. In yet another embodiment of the invention, an expressed protein 
may be partially isolated or partially purified according to any of the methods set forth or 
1 0 described herein. The skilled artisan will know when it is not desirable to isolate or purify a 
protein of the invention, and will be familiar with the techniques available for the use and 
preparation of such proteins. 

II. Methods of providing a protein to a customer 

[0144] The present invention features a method of providing a protein to a customer. In an 
15 embodiment of the invention, a nucleic acid encoding a protein is cloned into an expression 
vector. The encoded protein is expressed from the expression vector, and the resulting 
protein product is provided to a customer. 

[0145] In an embodiment of the invention, a protein is expressed from an expression vector 
in vitro. Techniques for in vitro protein expression are known in the art, and are exemplified 
20 by the methods of Melton and colleagues (Krieg et al., 1987, Meth. Enzymol. 155, 397-415; 
Yisraeli et al., 1989, Meth. Enzymol. 180, 42-50). A protein produced using an in vitro 
expression method of the present invention is provided to a customer. 

[0146] In another embodiment of the invention, a protein is expressed from an expression 
vector in vivo. Numerous techniques for expression of a protein from an expression vector in 
25 vivo are described in detail elsewhere herein, and are also well-known in the art. Such 

techniques include, but are not limited to, expression of a protein from a vector in a bacterial 
host cell. A protein produced using an in vivo expression method of the present invention is 
provided to a customer. 

[0147] In one aspect of the invention, a protein is expressed from a pcWINl expression 
30 vector, as set forth in SEQ ID NO: 1 . In another aspect of the invention, a protein is 

expressed from a pcWIN2 expression vector, as set forth in SEQ ID NO:2. In yet another 
aspect of the invention, a protein is expressed from a pcWIN2/MBP expression vector, as set 
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forth in SEQ ID NO:3. In still another aspect of the invention, a protein is expressed from a 
pCWin2-MBP-SBD 39 (pMS 39 ) expression vector, as set forth in SEQ ID NO: 10. In yet 
another aspect of the invention, a protein is expressed from a pCWin2-MBP-MCS-SBD 39 
(pMXS 39 ) expression vector, as set forth in SEQ ID NO: 1 1 . As will be understood by one of 
5 skill in the art, a pcWIN vector of the present invention is useful in any of the expression 
methods set forth herein for the production of a target protein that may be provided to a 
customer. 

[0148] Methods of the present invention for in vivo expression of a protein in a bacterial 
cell comprise transformation of the bacterial cell with an expression vector comprising the 
10 protein of interest. Methods of transforming a bacterial cell with a vector are described in 

detail elsewhere herein, and would be understood by one of ordinary skill in the art. It will be 
appreciated that methods of bacterial cell transformation other than those explicitly disclosed 
herein are useful in methods of the present invention, and therefore, are within the scope of 
the present invention. 

15 [0149] Vectors featured in methods of the present invention are described in detail 

elsewhere herein. In an embodiment of the invention, a method of providing a protein to a 
customer comprises expressing a protein from an expression vector useful for production of a 
therapeutic protein. As described above, vectors of the invention useful in such methods are 
comprised of an antibiotic resistance marker such as kanamycin, tetracycline, 

20 chloramphenicol, and the like, as such antibiotics are particularly useful in connection with 
the expression of therapeutic proteins. A therapeutic protein provided to a customer using 
this method is particularly useful to the customer due to the fact that kanamycin, tetracycline, 
chloramphenicol and like antibiotics are preferred by certain regulatory agencies for the 
production of therapeutic proteins. 

25 [0150] The present invention therefore features a method of providing a known therapeutic 
protein to a customer. Therapeutic proteins include, but are not limited to, human growth 
hormone, granulocyte colony stimulating factor, interferons alpha, -beta, and -gamma, Factor 
IX, follicle stimulating hormone, beta-glucosidase, interleukin-2, erythropoietin, alpha- 
galactosidase-A, and anti-TNF-alpha. It will be understood by the skilled artisan that any 

30 nucleic acid encoding a therapeutic protein, wherein the nucleic acid is capable of being 
cloned into and expressed from a nucleic acid vector of the invention, will be useful in the 
present invention. The ability to determine a therapeutic protein useful in the present 
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invention is within the skill of the ordinary artisan and such a determination does not require 
undue experimentation. 

[0151] In another embodiment of the invention, a method of providing a protein to a 
customer comprises expressing a protein from an expression vector useful for production of a 
5 reagent protein. Vectors of the invention useful in such methods preferably have had the 
native ampicillin resistance gene disabled, altered, or deleted, such that the ampicillin 
resistance gene is no longer functional in the vector. Such vectors are comprised of any 
antibiotic resistance marker other than ampicillin known in the art to be useful in the 
expression of proteins. Antibiotic resistance markers useful in the invention include 
10 kanamycin, tetracycline, chloramphenicol, and like antibiotic resistance markers approved by 
certain regulatory agencies, as well as any antibiotic resistance marker not approved by 
certain regulatory agencies for use with therapeutic proteins. 

[0152] Therefore, the present invention also features methods of providing a protein to a 
customer, wherein the protein is a reagent protein, and therefore, need not be expressed from 

1 5 a vector containing antibiotic resistance marker accepted by a regulatory agency. However, a 
reagent protein may also be expressed from a vector containing an FDA-accepted antibiotic 
resistance marker. A protein produced by such a method of the invention may be useful for 
almost any purpose, including, but not limited to, an enzyme reagent, a food enzyme, a 
nutritional supplement, and a non-active additive. Examples of such proteins include, but are 

20 not limited to a glycosyltransferase and a sugar nucleotide-generating enzyme. 

[0153] In an embodiment of the invention, a method is provided wherein a nucleic acid is 
cloned into a vector containing any antibiotic resistance marker useful in the expression of a 
protein, the protein is expressed therefrom, and the resulting protein product is provided to a 
customer. It will be understood that such a protein may be expressed in vivo or in vitro. 

25 [0154] Methods of the present invention also feature a vector comprising a highly- 
functional multiple cloning site. Such vectors are described in detail elsewhere herein. In an 
embodiment of the invention, a method of providing a protein to a customer comprises 
expressing a protein from an expression vector useful for production of a therapeutic protein. 
As described above, vectors of the invention useful in such methods are comprised of a 

30 highly-functional multiple cloning site, in addition to an antibiotic resistance marker such as 
kanamycin, tetracycline, chloramphenicol, and the like. In this embodiment, a therapeutic 
protein is expressed cloned into a vector comprising a highly- functional multiple cloning site 
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in addition to an antibiotic resistance marker, expressed therefrom, and provided to a client. 
In one aspect of the invention, the multiple cloning site contains at least one of Ndel, BamHI, 
Sad, Hindlll, Xbal, Xhol, EcoRI, Kpnl, and Sail restriction enzyme cleavage sites. 

[0155] Methods of the present invention also feature a vector comprising an affinity tag. 
5 Such vectors are described in detail elsewhere herein. In an embodiment of the invention, a 
method of providing a protein to a customer comprises expressing a protein from an 
expression vector useful for production of a therapeutic protein. As described above, vectors 
of the invention useful in such methods are comprised of an affinity tag, in addition to an 
antibiotic resistance marker such as kanamycin, tetracycline, chloramphenicol, and the like. 
1 0 In this embodiment, a therapeutic protein is expressed cloned into a vector comprising an 

affinity tag in addition to an antibiotic resistance marker, expressed therefrom, and provided 
to a client. In a preferred embodiment, the affinity tag is a maltose-binding protein. 

[0156] In other embodiments of the invention, a useful affinity tag may be, but is not 
limited to, a histidine tag, a Factor IX tag, a glutathione-S-transferase tag, starch-binding 
1 5 domain, a FLAG-tag, and the like. One of skill in the art will understand that any affinity tag 
capable of being used with a vector of the present invention will be useful in methods of the 
invention. Further, the skilled artisan will also appreciate that a single vector of the invention 
may comprise more than one affinity tag, and that multiple affinity tags may be identical or 
may be heterogeneous in sequence. 

20 [0157] The present invention also features a method of providing a protein to a customer, 
wherein an expression vector used to express a protein has multiple characteristics as 
described elsewhere herein. For example, a method of the present invention for providing a 
protein to a customer comprises the cloning of a nucleic acid encoding the protein into an 
expression vector, wherein the expression vector comprises a kanamycin resistance marker 

25 and a highly- functional multiple cloning site. 

[0158] In an embodiment of the invention, a method of providing a protein to a customer 
comprises the cloning of a protein into the expression vector set forth in SEQ ID NO:l . The 
protein-SEQ ID NO:l construct is transformed into an E. coli cell, the protein is expressed 
therefrom, and the protein product is provided to the customer. In one aspect of the 
30 invention, the E. coli cell is a JM109 cell. In another aspect of the invention, the protein is a 
therapeutic protein. In yet another aspect of the protein, the protein is a reagent protein. 
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[0159] In another embodiment of the invention, a method of providing a protein to a 
customer comprises the cloning of a protein into the expression vector set forth in SEQ ID 
NO:2. The protein-SEQ ID NO:2 construct is transformed into an E. coli cell, the protein is 
expressed therefrom, and the protein product is provided to the customer. In one aspect of 
5 the invention, the E. coli cell is a JM109 cell. In another aspect of the invention, the protein 
is a therapeutic protein. In yet another aspect of the protein, the protein is a reagent protein. 

[0160] In another embodiment of the invention, a method of providing a protein to a 
customer comprises the cloning of a protein into the expression vector set forth in SEQ ID 
NO:3. The protein-SEQ ID NO:3 construct is transformed into an E. coli cell, the protein is 
10 expressed therefrom, and the protein product is provided to the customer. In one aspect of 
the invention, the E. coli cell is a JM109 cell. In another aspect of the invention, the protein 
is a therapeutic protein. In yet another aspect of the protein, the protein is a reagent protein. 

[0161] In still another embodiment of the invention, a method of providing a protein to a 
customer comprises the cloning of a protein into the expression vector set forth in SEQ ID 
15 NO: 10. The protein-SEQ ID NO: 10 construct is transformed into an E. coli cell, the protein 
is expressed therefrom, and the protein product is provided to the customer. In one aspect of 
the invention, the E. coli cell is a JM109 cell. In another aspect of the invention, the protein 
is a therapeutic protein. In yet another aspect of the protein, the protein is a reagent protein. 

[0162] In another embodiment of the invention, a method of providing a protein to a 
20 customer comprises the cloning of a protein into the expression vector set forth in SEQ ID 
NO:l 1. The protein-SEQ ID NO:l 1 construct is transformed into an E. coli cell, the protein 
is expressed therefrom, and the protein product is provided to the customer. In one aspect of 
the invention, the E. coli cell is a JM109 cell. In another aspect of the invention, the protein 
is a therapeutic protein. In yet another aspect of the protein, the protein is a reagent protein. 

25 [0163] The present invention features a method of providing a protein to a customer, 

wherein a nucleic acid encoding a protein is cloned into an expression vector of the invention 
by the party providing a vector to a recipient. That is, a nucleic acid encoding a protein is 
cloned into an expression vector of the invention before the vector is transferred to a 
recipient. In an embodiment of the invention, a method of providing a protein to a customer 

30 comprises providing a vector to a recipient, wherein the vector contains a nucleic acid 

encoding a protein. The recipient of the vector expresses the protein, and the protein is then 
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provided to a customer. In one aspect of the invention, a recipient is a protein production 
facility. 

[0164] In an embodiment of the invention, a method of providing a protein to a customer 
comprises providing a vector to a recipient, wherein the vector does not contain a nucleic acid 
5 encoding a protein. The recipient of the vector clones a nucleic acid encoding a protein into 
the vector and expresses the protein, and the protein is then provided to a customer. In one 
aspect of the invention, a recipient is a protein production facility. In another aspect of the 
invention, the nucleic acid cloned into a vector is provided by the party providing the vector 
to the recipient. In yet another aspect of the invention, the nucleic acid cloned into a vector is 
1 0 provided by the recipient. 

[0165] The invention also features a method of providing a protein to a customer, wherein 
the method comprises providing a vector to a recipient, wherein the vector comprises a 
nucleic acid encoding a protein, for the purpose of expression of the protein encoded by the 
vector. In one embodiment of the invention, the recipient is a protein production facility. In 
15 one aspect, the protein production facility is in-house. By way of non-limiting examples, such 
recipients include an in-house protein production facility and an in-house laboratory. In 
another aspect of the invention, the protein production facility is offsite. By way of non- 
limiting examples, such recipients include an offsite protein production facility, an offsite 
laboratory, an offsite biotechnology company and an offsite pharmaceutical company. 

20 [0166] A protein produced using a method of the present invention may be provided to a 

customer by the recipient of the vector, wherein the recipient is responsible for expressing the 
protein from the vector provided by another party. Alternatively, a protein produced using a 
method of the present invention may be provided to a customer by the original party that 
provided the vector to the recipient, wherein the recipient expresses the protein and provides 

25 the resulting protein product to the original party so that the original party may provide the 
protein to a customer. 

[0167] A protein produced using a method of the present invention may be provided to a 
customer in the form of a purified protein, a partially purified protein, an isolated protein, a 
partially isolated protein, a bacterial cell lysate, cell paste or purified inclusion bodies. It will 
30 be understood that a protein produced using a method of the present invention may be 

provided to a customer in any form known in the art to be useful for the storage, transfer, or 
processing of a recombinant protein. 
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[0168] The present invention also features a method of providing a protein to a customer, 
wherein at least one glycosyl moiety is added to the protein before providing the protein to 
the customer. A glycosyl moiety may be added to a protein using any method known in the 
art. Additionally, a glycosyl moiety may be added to a protein of the invention using any one 
5 of the methods or reagents taught by DeFrees et al. in PCT Application WO 03/03 1464, 

which is incorporated herein by reference in its entirety. The skilled artisan will understand, 
based on the disclosure herein, that any of the methods known in the art or set forth herein are 
useful for glycosylating a protein of the present invention prior to providing the protein to a 
customer. 

10 [0169] Thus, in an embodiment of the present invention, a method of providing a protein to 
a customer comprises providing a vector comprising a nucleic acid encoding a protein to a 
recipient, wherein the recipient expresses the protein, and further wherein the protein is 
modified with at least one glycosyl moiety before the protein is provided to a customer. In 
one aspect of the invention, at least one glycosyl moiety is added to the protein by the 

1 5 recipient before providing the protein to a customer. In another aspect of the invention, at 

least one glycosyl moiety is added to the protein by the recipient before providing the protein 
to the original supplier of the vector, wherein the original supplier of the vector subsequently 
provides the protein to a customer. In yet another aspect of the invention, the recipient 
provides the expressed protein to the original supplier of the vector, wherein the original 

20 supplier of the vector adds at least one glycosyl moiety to the protein before providing the 
protein to a customer. 

[0170] In another embodiment of the invention, a method of providing a protein to a 
customer comprises providing a vector to a recipient, wherein the recipient clones a nucleic 
acid encoding a protein into the vector, expresses the protein, and further wherein the protein 

25 is modified with at least one glycosyl moiety before the protein is provided to a customer. In 
one aspect of the invention, at least one glycosyl moiety is added to the protein by the 
recipient before providing the protein to a customer. In another aspect of the invention, at 
least one glycosyl moiety is added to the protein by the recipient before providing the protein 
to the original supplier of the vector, wherein the original supplier of the vector to the 

30 recipient provides the protein to a customer. In yet another aspect of the invention, the 
recipient provides the expressed protein to the original supplier of the vector, wherein the 
original supplier of the vector adds at least one glycosyl moiety to the protein before 
providing the protein to a customer. 
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EXPERIMENTAL EXAMPLES 
[0171] The invention is now described with reference to the following examples. These 
examples are provided for the purpose of illustration only and the invention should in no way 
be construed as being limited to these examples but rather should be construed to encompass 
5 any and all variations which become evident as a result of the teaching provided herein. 

Example 1 : Modification of pCWori+ Amp r expression vector by disrupting the Amp r gene 
and adding the kanamycin resistance gene 

[0172] The pCWori+ Amp r vector contains an ampicillin resistance marker, as well as the 
genes encoding N. Meningitidis CMP -NAN synthetase (CNS) and Campylobacter Jejuni 
10 a2,3 Sialyl Transferase (CstI), referred to as Cst-04 (Cst-04 was provided by Warren 

Wakarchuck, National Research Council, Canada). This example describes the complete 
process by which the Cst-04 (pCWori+ amp r -CNS-CstI) plasmid was interrupted at the Pvul 
and Seal sites of ampicillin gene by the insertion of the kanamycin resistance gene. 

[0173] A kanamycin resistance gene was isolated from pGEX-Kt-ext Kan r using PCR to 

1 5 generate cDNA with modified restriction sites at 5' (Pvul- 

ATTCCAATTCGATCGGGGGGGGGGGGAAA) (SEQ ID NO:4) and 3' (Scal- 
ATTCCAAGTAGTACTTTAGAAAAACTCATCG) (SEQ ID NO:5) ends. The PCR product 
was then subcloned into a Cst04 (pCWori+ amp r -CNS-CstI) vector in TGI cells. A colony 
positive for the recombinant vector (Cst-04Kan5) was identified, and the Cst-04Kan5 plasmid 

20 was isolated, then transformed into JM 109 cells. 

[0174] A PCR reaction was conducted containing 1 ng pGEXKan r template, 1 jag (1 jal) 
kan-Scal/Pvul primer, 77 pi H 2 0, 8 jal dNTP mixture, 10 pi 1 OX buffer, and 1 pi Vent 
polymerase. The reaction parameters included a 5 minute cycle at 95 °C, followed by the 
addition of 1 pi of Vent polymerase and thirty cycles of the following temperature pattern: 94 
25 °C for one minute, 55 °C for one minute, 72 °C for one minute. 

[0175] The PCR product and the Cst-04 vector were subjected to a restriction digest. The 
PCR product digest included 16 pi PCR rxn, 2 pi 1 OX buffer, 0.5 pi Pvul, 0.5 pi Seal, 1 pi 
H 2 0. The pGEX-Ktext Kan r vector digest included 1 pi pGEX-Ktext Kan r vector, 2 pi 1 OX 
buffer, 0.5 pi Pvul, 0.5 pi Seal, 1 pi H 2 0. Both digests were incubated at 37 °C for 3 hours. 
30 Both the digested PCR fragment and the digested vector DNA were purified from 0.8% TAE 
agarose gels (Figure 1 A). 
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[0176] The PCR product was then ligated into Cst-04 vector. The ligation reaction 
contained 7 |il gel-purified Kan r gene (cut with Sacl/Xbal), 1 \il gel-purified Cst-04 vector 
(cut with BamHI/EcoRI), 1 jul 10X Ligation Buffer 1 ^1 T4 DNA ligase, and was incubated 
on ice overnight. The ligated PCR product was then transformed into the TGI competent 
5 cells. The transformation reaction conditions included 500 |al (thawed on ice) TGI 

Competent cells and 5jli1 pGEX-KT-exT-kan r - CNS-CstI ligation rxn. The cell/DNA mixture 
was incubated on ice for 30 minutes, and the cells were heat shocked at 42°C for 45 seconds 
and then incubated on ice again for 2 minutes. 500 jal LB broth was added and the mixture 
was shaken at 37°C for 1 hour. The transformation reactions were then plated on LB Kan r 
10 plates and incubated 37°C overnight. The results of the transformation reactions are shown in 
Figure IB. 

[0177J Positive clones were screened for using the following method. Two milliliters of 9x 
LB/Kanamycin (lOjag/ml) culture was incubated with individual transformants at 37°C o/n 
using 250 RPM shaking. 1 .5 milliliters of the overnight culture was transferred to an 
1 5 eppendorf tube to isolate plasmid DNA using Wizard Mini-Prep Kit (Qiagen, Valencia, CA). 
An insert-containing colony (Cst-04-Kan5, Figure 1C) was expanded in 100 ml of LB culture 
in order to isolate more plasmid DNA. 

[0178] The Cst-04-Kan5 plasmid-containing colony was screened for kanamycin and 
ampicillin resistance. Cst-04-Kan5 colony was streaked on both AFLB Kan r and Amp r 
20 plates, which were incubated overnight at 37 °C. Figure ID shows that, in colony Cst- 
04Kan5, the kanamycin resistance gene is active and the Ampicillin resistance gene is 
inactive. 

[0179] The Cst-04Kan5 plasmid was transfomed into E. coli JM109 cells using 20 jul 
JM109 competent cells in 0.34 jal 1.42 M beta mercaptoethanol, incubated on ice for 10 

25 minutes, then adding 1 jal (100 ng) Cst-04-Kan5 plasmid. The cell/DNA mixture was 

incubated ice for 30 minutes, then heat shocked at 42 °C for 45 seconds. The reaction was 
then incubated on ice for 2 minutes, at which time 80 jal SOC or LB was added. The reaction 
mixture was shaken at 37°C for 1 hour, then plated on LB Kan 1 * plates. Identification and 
confirmation of the Cst-04-Kan5 plasmid DNA was carried out with a restriction enzyme 

30 digestion of plasmid DNA isolated from positive transformants, using Ndel, Sail, PstI 

restriction enzymes. The restriction fragment sizes were -7.3 kb for one cut, such as Ndel or 
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Sail. Three bands (~1.7kb, ~2.2kb, and ~3.2kb) were observed when Cst-04-Kan5 DNA was 
cut with PstI (See Figure 1C, lane 4). 

[0180] Starter cultures of Cst-04Kan5 plasmid-containing cells were produced and used to 
inoculate 1 00ml cultures for the generation of cell lysates. Centrifuged cell pellets resulting 
5 from the large-scale cultures were resuspended in 5ml H2O prior to lysis in a French press. 
The resultant lysate was centrifuged at 4°C, 18,000 RPM for 20 minutes. The clarified lysate 
was subsequently used for activity analysis and the remainder of the lysate was stored at - 
20°C. 

[0181] The activity of the cell lysates was determined under the assay conditions 
10 illustrated in Table 1. 
Table 1. 





Final 


Stock 




Reagent 


Concentration 


Concentration 


Amounts 


CTP 


1 mM 


100 mM 


lul 


NAN 


1 mM 


200 mM 


0.5 ul 


LacPsy 


0.5 mM 


2.5 mM 


20 ul 


MgC12 


50 mM 


1M 


5 ul 


Tris pH 8 


lOOmM 


1M 


10 ul 


Lysate 


15% 


Crude 


15 |al 


dH20 






63 ul 


Total 








reaction 






100 ul 



volume 

[0182] Table 1 lists the reagents, and concentrations and volumes thereof, used in the lysate 
activity assays. 

[0183] The lysate assay reagents were mixed and incubated at 37°C. Time points were 
15 taken at 0 minutes and 1 hour. A negative control (pGEX-Kt-exT-kan r vector without insert) 
was also included. All time points were analyzed using thin layer chromatography 
(CHCl3:CH 3 OH:H2O:NH 4 OH:60:40:5:l respectively). The plates were air dried, dipped in 
anisaldehyde and heated on a hot plate until the spots developed. Cst-04Kan5 lysate was also 
assayed for activity using lacto-N-neotetraose as substrate. The lacto-N-neotetraose substrate 
20 activity is illustrated in Figure IE. Activity of lysates from Cst-04Kan5 plasmid-containing 
cells was 8500 units/liter. 
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Example 2: Modification of the polylinker of pCWori+ Kan r expression vector. 
[0184] pCWori+ Kan r (Cst-04Kan5) contains the genes encoding N. meningitidis CMP- 
NAN synthetase (CNS) and Campylobacter jejuni a2,3 Sialyl Transferase (CstI) at the 
multiple cloning site. This vector appears to give high levels of expression of recombinant 
5 proteins but is hard to use due to the limited number of restriction sites in the multiple 
cloning site (MCS). Therefore, the multiple cloning site was modified and expanded as 
described herein. 

[0185] The multiple cloning site starting at Ndel restriction site and extending to the start 
of the inactive Amp r gene from Cst04Kan5 was generated using PCR to generate cDNA with 

10 modified multiple cloning sites at 5' (Pcwmcs (Ndel) - 

ATCGATCGACATATGGGATCCGAGCTCAAGCTTTCTAGACTCGAGGAATTCGGTA 
CCGTCGACATCGATGATAAGCTGTCAAA) (SEQ ID NO:6) and 3' (Seal- 
ATTCCAAGTAGTACTACTCTTCCTTTTTCAA) (SEQ ID NO:7) ends of pCWINl 
construct. The PCR primers set for pre pCWIN2 construct are 5' (Bgl II- 

15 CAATTATATAGATCTATCGATGCTTAGGAGGT) (SEQ ID NO:8) and 3' (Cstl-Xba- 
TTGCCTTATTCTAGATCATTAGTGGTGATGGTGGTG) (SEQ ID NO:9). The PCR 
products were then subcloned into Cst04kan5 (pCWoriH- kan r -CNS-CstI) vector, transformed 
into TGI cells, and screened for the correct construct. 

[0186] Two PCR reactions were conducted, using 10 ng Cst04Kan5 cDNA as a template. 

20 The first reaction contained 1 jxg (1 ^1) Pcwmcs/Scal-pcw primer, 78 \x\ H 2 0, 8 jal dNTP 

mixture, 10 \il 10X buffer, and 1 \x\ Vent polymerase. The second reaction contained 1 \ig (1 
jal) 5'pcBglII/Cstl-Xba primer, 78 \x\ H 2 0, 8 \i\ dNTP mixture, 10 \x\ 10X buffer, and 1 \x\ 
Vent polymerase. The PCR reaction parameters included a 5 minute cycle at 95 °C, followed 
by the addition of 1 \x\ of Vent polymerase and thirty cycles of the following temperature 

25 pattern: 94 °C for one minute, 55 °C for one minute, 72 °C for one minute. 

[0187] The PCR products were subjected to a restriction digest. The first PCR reaction 
product ("pCWINl" insert) digest included 16 \x\ PCR rxn, 2 \i\ 10X buffer, 0.5 \i\ Ndel, 0.5 
|lx1 Seal, 1 \i\ H 2 0, and the second PCR reaction product ("pre pCWIN2" insert) digest 
included 16 jal PCR rxn, 2 \A 10X buffer, 0.5 \i\ Bglll, 0.5 \x\ EcoRI, 1 \x\ H 2 0. A pCWori 
30 Kan r Cst04Kan5 vector was prepared for insertion of the first PCR product by incubation of 2 
\l\ (1 |ig) Cst04Kan5 vector, 2 \x\ 10X buffer, 0.5 \x\ Ndel, 0.5 \i\ Seal, 1 ^1 H 2 0. A pCWori 
Kan r Cst04Kan5 vector was similarly prepared for insertion of the second PCR product by 
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incubation of 2 jil (1 jag) Cst04Kan5 vector, 2 j^l 10X buffer, 0.5 |il BamHI, 0.5 jal EcoRI, 1 
|il H 2 0. 

[0188] The digested PCR fragments and digested vectors were purified from 0.8% TAE 
agarose gels (Figure 2A). The PCR products were then subcloned into Cst-04kan5 vectors by 
ligation (Table 2) and electroporated into the TGI competent cells. Electroporation reactions 
included 30 jal thawed (on ice) TGl/DH5a electrocompetent cells and 3 |lx1 ligation reaction 
mixture. The DNA/cell electroporation mixture was transferred to a chilled cuvette, and the 
cells were subjected to electroporation using pulses of 2.5 KV, R5 resistance, and 129 ohms. 
0.9 ml SOC media was then added to the reaction mixture, and the entire culture was 
incubated at 37 °C for one hour, at which time the electoporation product was incubated 
overnight after plating on LB agar plates containing 50 |ag/ml Kan r . 



Table 2 

1. pCWINl 

Gel-purified pCWINl insert (digested A) 7 jil 

Gel-purified Cst-04kan5 vector (digested B) 1 jul 

10X Ligation Buffer 1 jil 

T4 DNA Ligase 1 ^1 

2. pre pCWIN2 

Gel-purified pre pCWIN2 insert (digested C) 7 jal 

Gel-purified Cst-04kan5 vector (digested D) 1 jil 

10X Ligation Buffer 1^1 

T4 DNA Ligase 1 jil 

3. pCWIN2 

Gel-purified pCWINl #5 insert (digested E) 7 |il 

Gel-purified pre pCWIN2#l 1 vector (digested F) 1 pi 

10X Ligation Buffer ~ 1 jul 

T4 DNA Ligase 1 M l 



[0189] Table 2 illustrates the ligation reaction conditions for pCWINl, pre-pCWIN2, and 
pCWIN2 PCR reaction products. Both pCWINl and pre pCWIN2 ligations were incubated 
at 4°C overnight. 

[0190] Screening of transformants for positive clones was then conducted. Five colonies 
pCWINl -containing TGI cells were selected, as were 9 colonies of pre-pCWin2 in DH5oc, 
and 18 colonies of pCWin2 in TGI. Each was placed into 2 ml TB/kanamycin (50|ag/ml) and 
incubated at 37°C for 5 hours with shaking at 250 RPM. 1.5 milliliters of each culture was 
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transferred to an eppendorf tube to isolate plasmid DNA using Wizard Plus Mini-Prep Kit 
(Qiagen, Valencia, CA). Each plasmid DNA preparation was subjected to restriction 
digestion with the appropriate restriction endonucl eases as described above. The digestion 
reactions were then analyzed on agarose/TAE gels (Figures 2B - 2D). 

5 [0191] One colony of pCWinl (colony #5) and one colony of pre pCWin2 (colony #1 1) 
contained the appropriate sized inserts. Specifically, the expected size for the pCWinl insert 
was 5 kb and the expected size for the pre-pCWin2 insert was 7 kb (Figure 2B). pCWinl and 
pre pCWin2 plasmid DNA were digested with Ndel and Seal. Ndel and Seal digestion of 
pCWinl plasmid generated bands of 750 bp and 4.2kb, and the same digestion of pre- 

1 0 pCWin2 plasmid generated bands of 2.8 kb and 4.2 kb (Figure 2C). Figure 2D illustrates the 
result of ligating the 750 bp of pCWinl with the 4.2 kb fragment of pre-pCWin2 to generate 
the pCWin2 expression vector. The difference between pCWinl and pCWin2 expression is 
that pCWinl has two BamHl sites, one in being in the tac promoter and the other being in the 
multiple cloning site (down stream of Ndel). pCWin2 has only one BamHl site, in the 

15 multiple cloning site, and the BamHl that resided in the tac promoter was destroyed. 

Example 3: Addition of Maltose Binding Protein to pCWin2 Kan r Expression Vector 
[0192] The E.coli malE gene, encoding a maltose binding protein, was subcloned into the 
pCWin2 kan r bacterial expression vector. The malE gene was PCR amplified from pMal- 
c2X, ligated into the multiple cloning site of pCWin2 kan r , and subsequently transformed into 
20 electrocompetent DH5a E. coli. The final product, a pCWin2MBP kan r bacterial fusion tag 
expression vector, was created as described below. 

[0193] Restriction endonuclease digestion of pCWin2 kan r and pMAL-c2X amp r was 
conducted to prepare the malE maltose binding protein cDNA and the pCWin2 vector cDNA 
for insertion of the malE cDNA into the pCWin2 vector. Digestion of the malE cDNA was 
25 conducted using 2 \i\ of pMAL-C2X vector DNA (l^ig/jul), 2 \i\ 1 OX BamHl NEbuffer, 2 |nl 
10X purified BSA, 1 jal Ndel, 1 |j.l BamHl, and 12 \i\ dH 2 0. Digestion of the vector was 
conducted using 2 \i\ pCWin2 vector DNA 0.8|ig/^l, 2 |li110X BamHl NEbuffer, 2 \i\ 10X 
purified BSA, 1 \i\ Ndel, 1 \i\ BamHl, and 12 \x\ dH 2 0. 

[0194] The restriction enzyme digestions were incubated at 37°C for two hours. The 
30 reactions were stopped by adding 3 \i\ Blue/Orange 6x Agarose Loading Dye. The digestions 
were then loaded onto separate 0.7% agarose/TAE gels, and electrophoresed at 135 volts 
until the dyes migrated to the lower third of the gel. An image of the pCWin2 vector 



41 



digestion agarose gel was captured using a digital camera (Figure 3B). An image of the 
polyacrylamide gel containing the purified product from the digestion of malE is shown in 
Figure 3 A. 

[0195] The linearized pCWin2 kan r and malE fragments were gel purified. Using the UV 
5 box to illuminate the DNA, the bands of DNA were excised using a sterile scalpel. The 
pCWin2 kan r DNA was approximately 5 kb, and the malE gene was approximately 1 .2 kb. 
The excised agarose wedges were placed into Ultrafree DA agarose extraction spinfilters 
(Millipore, Belled ca, MA), and microcentrifuged at 10,000 xg for 5 minutes. The filtrates 
were transferred to YM-100 spinfilters (Millipore, Bellirica, MA), and the DNA was washed 
10 by adding 300 \x\ dH 2 0. The spinfilters were centrifuged at 500 xg for 15 minutes, and the 

wash step was repeated two additional times. The last wash step concentrated the DNA to an 
approximate volume of 25 jil, at which time the column was inverted into another autoclaved 
1 .7 mL eppendorf tube. The DNA retentate was collected by microcentrifuging the 
eppendorf at lOOOxg for one minute. 

1 5 [0196] The ligation of gel purified malE and linearized pCWin2 kan r was performed in an 
autoclaved 0.5 mL eppendorf microcentrifuge tube. The ligation reaction included 7 \x\ of 
purified malE DNA that was digested with Ndel and BamHI, 1 ^1 of linearized pCWin2 kan r 
that was digested with Ndel and BamHI, 1 ^1 10X ligase buffer, 1 |il T4 DNA ligase. The 
ligation reaction was incubated at room temperature for three hours. In the vector control 

20 ligation reaction, dH 2 0 was substituted for the malE DNA. 

[0197] The ligation reactions were transformed into electrocompetent DH5a E.coli. After a 
three hour ligation incubation, one aliquot of electrocompetent DH5a E.coli was removed 
from a -81°C freezer, and placed on ice to thaw. 20 jlxI of the cells was aliqouted into chilled, 
autoclaved 1.7 mL microcentrifuge tubes, and then one microliter from each ligation reaction 

25 was added to the cells. Immediately, the reactions were transferred to chilled electroporation 
cuvettes. The cells were electroporated with a 2.5kV 6 msec pulse as described in the 
manufacturer's instructions. Then one milliliter of AFLB SOC media was added to the 
transformation reactions, and the entire volume was transferred to an autoclaved 1 .7 mL 
microcentrifuge tube. The transformation reactions were incubated at 37°C for one hour with 

30 shaking at 250 rpm. After incubating the cells for an hour, 100 |al from each transformation 
reaction was plated by spreading onto LB agar kan r plates. The plates were incubated at 37°C 
overnight. 
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[0198] Results from the ligation and transformation reactions are as follows. The pCWin2 
MBP vector plating resulted in thirteen colonies, and out of ten colonies selected, nine were 
positive for the recombinant vector. The pCWin2 vector control plating did not contain any 
colonies. Colonies from the pCWin2 MBP vector LB agar plates were selected and used to 
5 inoculate 2 ml AFLB kan r . The starter cultures were grown overnight at 37°C, with shaking 
at 250 rpm. Plasmid DNA was isolated from the transformants and screened for the correct 
malE insert by a double digestion with Ndel and BamHI restriction endonucleases. 

[0199] Restriction digestion of miniprep DNA was conducted in a reaction mixture 
containing 12 |j,l Miniprep DNA, 1 ^1 Ndel endonuclease, 1 jul BamHI endonuclease, 1.5 ^il 

10 lOXBamHI NEBuffer, and 1 \i\ 10X purified BSA. The digestion reactions were mixed and 
incubated at 37°C for one hour. After incubation, 3 jil of 6x Agarose Gel Loading Dye was 
added to each restriction digestion. The restriction digestions were then loaded into the wells 
of a 0.7% agarose/TAE gel. The samples were electrophoresed at 135 volts until the dye 
migrated to the lower third of the gel. The gel was then removed from the gel box, and the 

15 image captured with a digital camera (Figure 3C). 

[0200] Large scale purified pCWin2MBP vector DNA was isolated from transformant #1 
using the HiSpeed Plasmid Maxi Kit (Qiagen, Valencia, CA). A 2 mL AFLB kan r starter 
culture was inoculated with 10 jj.1 pCWin2MBP DH5a E.coli overnight culture. This starter 
culture was grown overnight in a 37°C incubator, with shaking at 250 rpm. The overnight 
20 starter culture was used to inoculate two 125 mL AFLB kan r cultures, and these larger scale 
preps were grown overnight at 37°C with shaking at 250 rpm. DNA from the large scale 
preparation was used for sequencing of the malE insert subcloned between the Ndel and 
BamHI restriction sites (MWG Biotech, High Point, NC). The sequence of the vector is set 
forth in SEQ ID NO:3. 

25 Example 4: Preparation and Characterization of pCWin2-MBP-SBDWt>MSW) Vector 

[0201] The pMS 39 Kan R expression vector was created from the pCWIN2-MBP- SBD-ST3 
Gal III (Gal61,3(4)GlcNAc a2,3-Sialyltransferase) A73 construct, removing the the ST3 Gal 
III A73 gene and replacing it with the Multiple Cloning site (MCS) from the pcWIN2 vector. 
Selection was of final construct was determined by restriction enzyme analysis with Nde I 

30 and Xba I (there is no Xba I site the pCWIN2-MBP-SBD ST3 Gal III construct) digestion 
and sequence confirmation. The final construct was designated the pCWin2-MBP-SBD 
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(PMS39) Kan r expression vector. The several steps of the preparation of this vector are 
detailed in Figures 7A-7G. 

Example 5: Preparation and Characterization of the pCWin2-MBP-MCS-SBD (pMXS^o) 
Vector 

5 [0202] The pCWin2-MBP-MCS-SBD (pMXS 39 ) expression vector was constructed 

according to the following method. The Starch Binding Domain (SBD) insert was isolated 
by PCR using the 5' primer ( XhoI-SBD-39-5' 

TGTATCCTCGAGATTGTGGCGACCGGCGGCACCAC) (SEQ ID NO: 12) and the 3' 
primer (3' Sall-AAGCTTGTCGACTCATTAGCGCCAGGTATCGGTCACGG) (SEQ ID 
10 NO: 13). The PCR products were gel purified and ligated into PCR-Blunt vector. The correct 
SBD insert (in the PCR-Blunt vector) was digested with Xhol and Sail, subcloned into 
Xhol-Sall digested pCWin2-MBP kan r vector, transformed into TBI cells and screened for 
the correct construct. The several steps of the preparation of this vector are detailed in 
Figures 8A-8E. 

1 5 [0203] The disclosures of each and every patent, patent application, and publication cited 
herein are hereby incorporated herein by reference in their entirety. While the invention has 
been disclosed with reference to specific embodiments, it is apparent that other embodiments 
and variations of this invention may be devised by others skilled in the art without departing 
from the true spirit and scope of the invention. The appended claims are intended to be 

20 construed to include all such embodiments and equivalent variations. 
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